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Transgenic Mammals Having Human Ig Loci Including Plural 
V h Ai>a) V,. Regions and Antibodies Produced Therefrom 



FIELD OF THE INVENTION 

The present invention relates to transgenic non-human animals that are engineered to contain 
human immunoglobulin gene loci. In particular, animals in accordance with the invention possess 
human Ig loci that include plural variable (V„ and Vk) gene regions. Advantageously, the inclusion 
of plural variable region genes enhances the specificity and diversity of human antibodies produced 
by the animal. Further, the inclusion of such regions enhances and reconstitutes B-cell development 
to the animals, such that the animals possess abundant mature B-cells secreting extremely high affinity 
antibodies. 

BACKGROUND OF THE TECHNOLOOY 

The ability to clone and reconstruct megabase-sized human loci in YACs and to introduce 
them into the mouse germline provides a powerful approach to elucidating the functional components 
of very large or crudely mapped loci as well as generating useful models of human disease. 
Furthermore, the utilization of such technology for substitution of mouse loci with their human 
equivalents could provide unique insights into the expression and regulation of human gene products 
during development, their communication with other systems, and their involvement in disease 
induction and progression. 



wo 98/24893 



PCT/US97/23091 



An important practical application of such a strategy is the "humanization" of the mouse 
humoral immune system. Introdu ction of human immunoglobulin (Ig) loci into mice in which 'he 
endogenous Ig genes have been inactivated offers the opportunity to study of the mechanisms 
underlying programmed expression and assembly of antibodies as well as their role in B-cell 
developniciit. Furthermore, such a strategy could provide an ideal source for production of fiilly 
human monoclonal antibodies (Mabs) - an important milestone towards fulfilling the promise of 
antibody therapy in human disease. Fully human antibodies are expected to minimize the 
immunogenic and allergic responses intrinsic to mouse or mouse-derivatized Mabs and thus to 
increase the efficacy and safety of the administered antibodies. The use of fully human antibodies can 
be expected to provide a substantial advantage in the treatment of chronic and recurring human 
diseases, such as inflammation, autoimmunity, and cancer, which require repeated antibody 
administrations. 

One approach towards this goal was to engineer mouse strains deficient in mouse antibody 
production with large fragments of the human Ig loci in anticipation that such mice would produce 
a large repertoire of human antibodies in the absence of mouse antibodies. Large human Ig fragments 
would preserve the large variable gene diversity as well as the proper regulation of antibody 
production and expression. By exploiting the mouse machinery for antibody diversification and 
selection and the lack of immunological tolerance to human proteins, the reproduced human antibody 
repertoire in these mouse strains should yield high affinity antibodies against any antigen of interest, 
including human antigens. Using the hybridoma technology, antigen-specific human Mabs with the 
desired specificity could be readily produced and selected. 

This general strategy was demonstrated in connection with our generation of the first 
XenoMouse™ strains as published in 1994. 5^^ Green et al. Nature Genetics l\\2-2\ i\994). The 
XenoMouse™ strains were engineered with 245 kb and 190 kb-sized germline configuration 
fragments of the human heavy chain loci and kappa light chain loci, respectively, which contained 
core variable and constant region sequences. Id The human Ig containing yeast artificial 
chromosomes ( YACs) proved to be compatible with the mouse system for both rearrangement and 
expression of antibodies, and were capable of substituting for the inactivated mouse Ig genes. This 
was demonstrated by their ability to induce B-cell development and to produce an adult-like human 
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repertoire of fully human antibodies and to generate antigen*specific human Mabs. These results also 
suggested that introduction of larger portions of the human Ig loci containing greater numbers of V 
genes, additional regulatory elements, and human Ig constant regions might recapitulate substantially 
the fiill repertoire that is characteristic of the human humoral response to infection and immunization. 
5 Such approach is further discussed and delineated in U.S. Patent Application Serial Nos. 

07/466,008, filed January 12, 1990. 07/610,515, filed November 8, 1990, 07/919,297, filed July 24, 
1992, 07/922,649, filed July 30, 1992, filed 08/031,801, filed March 15,1993, 08/112,848, filed 
August 27, 1993, 08/234,145, filed April 28, 1994, 08/376,279, filed January 20, 1995, 08/430, 938, 
April 27, 1995, 08/464,584, filed June 5, 1995, 08/464,582, filed June 5. 1995, 08/463, 191 , filed June 
10 5, 1995, 08/462,837, filed June 5, 1995, 08/486,853, filed June 5, 1995, 08/486,857, filed June 5, 
1995, 08/486,859, filed June 5, 1995, 08/462,513, filed June 5, 1995, and 08/724,752, filed October 
2, 1996. See also European Patent No., EP 0 463 151 Bl, grant published June 12, 1996, 
International Patent Application No., WO 94/02602, published February 3, 1994, Intemaiional Patent 
Application No., WO 96/34096, published October 31, 1996, and PCX Application No, 
15 PCT/US96/05928, filed April 29, 1996. The disclosures of each of the above-cited patents and 
applications are hereby incorporated by reference in their entirety. 

In an akemative approach, others, including GenPharm International, Inc., have utilized a 
**minilocus" approach. In the minilocus approach, an exogenous Ig locus is mimicked through the 
inclusion of pieces (individual genes) from the Ig locus. Thus, one or more V„ genes, one or more 
20 D„ genes, one or more Jh genes, a mu constant region, and a second constant region (preferably a 
gamma constant region) are formed into a construct for insertion into an animal. This approach is 
described in U.S. Patent No. 5,545,807 to Surani et al. and US. Patent Nos. 5,545,806 and 
5,625,825, both to Lonberg and Kay, and GenPharm International U.S. Patent Application Serial 
Nos. 07/574,748, filed August 29, 1990, 07/575,962, filed August 31, 1990, 07/810,279, filed 
25 December 17, 1991, 07/853,408, filed March 18, 1992, 07/904,068, filed June 23, 1992, 07/990,860, 
filed December 16, 1992, 08/053,131, filed April 26, 1993, 08/096,762, filed July 22, 1993, 
08/155,301. filed November 18, 1993, 08/161,739, filed December 3, 1993, 08/165,699. filed 
December 10, 1993, 08/209,741, filed March 9, 1^94, the disclosures of which are hereby 
incorporated by reference. See also International Patent Application Nos. WO 94/25585, published 
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November 10, 1994, WO 93/12227, published June 24, 1993, WO 92/22645, published December 
23, 1992, WO 92/03918, published March 19, 1992, the disclosures of which are hereby incorporated 
by reference in their entirety. See further Taylor et al., 1992, Chen et al., 1 993, Tuaillon et al., 1993, 
Choi et al., 1993, Lonberg et al., (1994), Taylor et al., (1994), and Tuaillon et al., (1995), the 
5 disclosures of which are hereby incorporated by reference in their entirety. 

The inventors of Surani et al., cited above, and assigned to the Medical Research Counsel (the 
"MRC"), produced a transgenic mouse possessing an Ig locus through use of the minilocus approach. 
The inventors on the GenPharm International work, cited above, Lonberg and Kay, following the lead 
of the present inventors, proposed inactivation of the endogenous mouse Ig locus coupled \yith 
10 substantial duplication of the Surani et al. work. 

An advantage of the minilocus approach is the rapidity with which constructs including 
portions of the Ig locus can be generated and introduced into animals. Commensurately, however, 
a significant disadvantage of the minilocus approach is that, in theory, insufficient diversity is 
introduced through the inclusion of small numbers of V, D, and J genes. Indeed, the published work 
1 5 appears to support this concern. B-cell development and antibody production of animals produced 
through use of the minilocus approach appear stunted. Therefore, the present inventors have 
consistently urged introduction of large portions of the Ig locus in order to achieve greater diversity 
and in an effort to reconstitute the immune repertoire of the animals. 

Accordingly, it would be desirable to provide transgenic animals containing more complete 
20 germline sequences and configuration of the human Ig locus. It would be additionally desirable to 
provide such locus against a knockout background of endogenous Ig. 

Summary of the Invention 
Provided in accordance with the present invention are transgenic animals having a near 
25 complete human Ig locus, including both a human heavy chain locus and a human kappa light chain 
locus. Preferably, the heavy chain locus includes greater than about 20%, more preferably greater 
than about 40%, more preferably greater than about 50%, and even more preferably greater than 
about 60% of the human heavy chain variable region. In connection with the human kappa light 
chain, preferably, the locus includes greater than about 20%, more preferably greater than about 40%, 
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more preferably greater than about 50%, and even more preferably greater than about 60% of the 
human kappa light chain variable region. Such percentages preferably refer to percentages of 
functional variable region genes. 

Further, preferably such animals include the entire Dh region, the entire Jh region, the human 
mu constant region, and can additionally be equipped with genes encoding other human constant 
regions for the generation of additional isotypes. Such isotypes can include genes encoding Yi, Y2» 
Y3, a, €, P, and other constant region encoding genes. Alternative constant regions can be included 
on the same transgene, i.e., downstream from the human mu constant region, or, alternatively, such 
other constant regions can be included on another chromosome. It will be appreciated that where 
such other constant regions are included on the same chromosome as the chromosome including the 
human mu constant region encoding transgene, cis-switching to the other isotype or isotj^es can be 
accomplished. On the other hand, where such other constant region is included on a different 
chromosome from the chromosome containing the mu constant region encoding transgene, trans- 
switching to the other isotype or isotypes can be accomplished. Such arrangement allows tremendous 
flexibility in the design and construction of mice for the generation of antibodies to a v^de array of 
antigens. 

Preferably, such mice additionally do not produce functional endogenous immunoglobulins. 
This is accomplished in a preferred embodiment through the inactivation (or knocking out) of 
endogenous heavy and light chain loci. For example, in a preferred embodiment, the mouse heavy 
chain J-region and mouse kappa light chain J-region and C^-region are inactivated through utilization 
of homologous recombination vectors that replace or delete the region. Such techniques are 
described in detail in our earlier applications and publications. 

Unexpectedly, transgenic mice in accordance with the invention appear to possess an almost 
entirely reconstituted immune system repertoire. This is dramatically demonstrated when four 
separate mouse strains are compared: a first strain contains extensive human heavy chain variable 
regions and human kappa light chain variable regions and encodes only a mu isotype, a second strain 
contains extensive human heavy chain variable regions and human kappa light chain variable regions 
and encodes a mu and gamma-2 isotypes, a third strain contains significantly less human heavy and 
kappa light chain variable regions, and a fourth strain contains a double-inactivated mouse Ig locus. 
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The first and second strains undergo similar, if not identical, B-cell development, whereas the third 
strain has a reduced development and maturation of B-cells, and the fourth strain contains no mature 
B-cells. Further, it is interesting to note that production of human antibodies in preference to mouse 
antibodies is substantially elevated in mice having a knock-out background of endogenous Ig. That 
is to say that mice that contain a human Ig locus and a ftinctionally inactivated endogenous Ig 
produce human antibodies at a rate of approximately 100 to 1000 fold as efficiently as mice that 
contain only a human Ig locus. 

Thus, in accordance with a first aspea of the present invention there is provided a transgenic 
non-human mammal having a genome, the genome comprising modifications, the modifications 
comprising: an inaaivated endogenous immunoglobulin (Ig) locus, such that the mammal would not 
display normal B-cell development; an inserted human heavy chain Ig locus in substantially germline 
configuration, the human heavy chain Ig locus comprising a human mu constant region and regulatoiy 
and switch sequences thereto, a plurality of human J„ genes, a plurality of human D„ genes, and a 
plurality of human V„ genes; and an inserted human kappa light chain Ig locus in substantially 
germline configuration, the human kappa light chain Ig locus comprising a human kappa constant 
region, a plurality of Jk genes, and a plurality of Vk genes, wherein the number of V„ and Vk genes 
inserted are selected to substantially restore normal B-cell development in the mammal. In a preferred 
embodiment, the heavy chain Ig locus comprises a second constant region selected from the group 
consisting of human gamma-1, human gamma-2, human gamma-3, humaii gamma-4, alpha^ delta, and 
epsilon. In another preferred embodiment, the number of genes is greater than about 20. In 
another preferred embodiment, the number of Vk genes is greater than about 1 5. In another 
preferred embodiment, the number^of D„ genes is greater than about 25, the number of Jh genes is 
greater than about 4, the number of Vh genes is greater than about 20, the number of Jk genes is 
greater than about 4, and the number of Vk genes is greater than about 15. In another preferred 
embodiment, the number of Dh genes, the number of genes, the number of V„ genes, the number 
of Jk genes, and the number of Vk genes are selected such that the Ig loci are capable of encoding 
greater than about I x 10^ different functional antibody sequence combinations. In a preferred 
embodiment, in a population of mammals B-cell function is reconstituted on average to greater than 
about 50% as compared to wild type. 
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In accordance with a second aspect of the present invention there is , provided an improved 
transgenic non-human mammal having a genome that comprises modifications, the -modifications 
rendering the manunal capable of producing human immunoglobulin molecules but substantially 
incapable of producing functional endogenous immunoglobulin molecules, the improvement 
comprising: insertion into the genome of the mammal of sufficient human V„, D„. J„, Vk, and Jk 
genes such that the mammal is capable encoding greater than about 1x10^ different functional human 
immunoglobulin sequence combinations. 

In accordance with a third aspect of the present invention, there is provided an improved 
transgenic non-human mammal having a genome that comprises modifications, the modifications 
rendering the mammal capable of producing human immunoglobulin molecules but substantially 
incapable of producing fijnctional endogenous immunoglobulin molecules, which modifications, with 
respect to the mammal's incapacity to produce functional endogenous immunoglobulin molecules 
would not allow the mammal to display normal B-cell development, the improvement comprising: 
insertion into the genome of the mammal of sufficient human V„, Dh, Jh, Vk, and Jk genes such that 
the mammal is capable of encoding greater than about 1 x 10* different functional human 
immunoglobulin sequence combinations and sufficient V^, and Vk genes to substantially restore 
normal B-cell development in the mammal. In a preferred embodiment, in a population of mammals 
B-cell function is reconstituted. pn average to greater than about 50% as compared to wild type. 

In accordance with a fourth aspect of the present invention, there is provided a transgenic 
non-human mammal having a genome, the genome comprising modifications, the modifications 
comprising: an inactivated endogenous heavy chain immunoglobulin (Ig) locus; an inactivated 
endogenous kappa light chain Ig locus; an inserted human heavy chain Ig locus, the human heavy 
chain Ig locus comprising a nucleotide sequence substantially corresponding to the nucleotide 
sequence of yH2; and an inserted human kappa light chain Ig locus, the human kappa light chain Ig 
locus comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 

In accordance with a fifth aspect of the present invention there is provided a transgenic non- 
human mammal having a genome, the genome comprising modifications, the modifications 
comprising: an inactivated endogenous heavy chain immunoglubulin (Ig) locus; an inserted human 
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heavy chain Ig locus, the human heavy chain Ig locus comprising a nucleotide sequence substantially 
corresponding to the nur^otide sequence of yH2; and an inserted human kappa light chain Ig locus, 
the human kappa light chain Ig locus comprising a nucleotide sequence substantially corresponding 
" to the nucleotide sequence of yK2. 

In accordance with a sixth aspect of the present invention, there is provided a transgenic non- 
human mammal having a genome, the genome comprising modifications, the modifications 
comprising: an inactivated endogenous heavy chain immunoglubulin (Ig) locus; an inactivated 
endogenous kappa light chain Ig locus; an inserted human heavy chain Ig locus, the human heavy 
chain Ig locus comprising a nucleotide sequence substantially corresponding to the nucleotide 
sequence of yH2 without the presence of a human gamma-2 constant region; and an inserted human 
kappa light chain Ig locus, the human kappa light chain Ig locus comprising a nucleotide sequence 
substantially corresponding to the nucleotide sequence of yK2. 

I n accordance with a seventh aspect of the present invention, there is provided^ 
transgenic non-human mammal having a genome, the genome comprising modifications, the 
modifications comprising: an inactivated endogenous heavy chain immunoglubulin (Ig) locus; an 
inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising a nucleotide 
sequence substantially corresponding to the nucleotide sequence of yH2 without the presence of a 
human gamma-2 constant region; and an inserted human kappa light chain Ig locus, the human kappa 
light chain Ig locus comprising a nucleotide sequence substantially corresponding to the nucleotide 
sequence of yK2. 

In accordance with an eighth aspect of the present invention, there is provided a method for 
the production of human antibodies, comprising: inoculating any of the mammals of the first through 
fifth aspects of the invention discussed above with an antigen; collecting and immortalizing 
lymphocytic cells to obtain an immortal cell population secreting human antibodies that specifically 
bind to the antigen with an affinity of greater than 10' M'*; and isolating the antibodies from the 
immortal cell populations. 

In a preferred embodiment, the antigen is IL-8. In another preferred embodiment, the antigen 
is EGFR. In another preferred embodiment, the antigen is TNF-cc. 

In accordance with a ninth aspect of the present invention, there is provided an antibody 
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produced by the method of the sixth aspect of the invention, including antibodies to IL-8, EGFR, and 
TNF-a. 

In accordance with a tenth aspect of the present invention, there is provided an improved 
method for the production of transgenic mice, the transgenic mice having a genome, the genome 
5 comprising modifications, the modifications comprising insertion of a plurality of human variable 
regions, the improvemmt comprising: insertion of the human variable regions from a yeast artificial 
chromosome. 

In accordance with an eleventh aspect of the present invention, there are provided transgenic 
mice and transgenic offspring therefrom produced through use of the improvement of the eighth 
10 aspect of the present invention. 

In accordance with a twelfth aspect of the present invention, there is provided a transgenic 
mammal, the transgenic mammal comprising a genome, the genome comprising modifications, the 
modifications comprising an inserted human heavy chain immunoglobulin transgene, the improvement 
comprising: the transgene comprising selected sets of human variable region genes that enable 
15 human-like junctional diversity and human-like complementarity determining region 3 (CDR3) 
lengths. In a preferred embodiment, the human-like junctional diversity comprises average N-addition 
lengths of 7.7 bases. In another preferred embodiment, the human-like CDR3 lengths comprise 
between about 2 through about 25 residues with an average of about 14 residues. 

20 BRIEF Description of the Drawing Figures 

Figure 1 is a schematic representation of the reconstructed human heavy chain and human 
kappa light chain loci YACs introduced into preferred mice in accordance with the invention, YACs 
spanning the human heavy chain (IH, 2H, 3H, and 4H) and the human kappa light chain proximal 
(IK, 2K, and 3K) loci were cloned from human -YAC libraries. The locations of the different YACs 

25 with respect to the human Ig loci (adopted from Cook and Tomlinson, 1995, and Cox et al., 1994), 
their sizes, and non-Ig sequences are indicated (not shown to scale). The YACs were recombined 
into yeast in a two-step procedure (see Materials and Methods) to reconstruct the human heavy and 
kappa light chain YACs, yH2, the human heavy chain containing YAC, was further retrofitted with 
a human gene sequence. yK2. was the human kappa light chain containing YAC. The YAC vector 
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elements: telomere centromere mammalian (HPRT, Neo) and yeast selectable markers 
(TRPl, ADE2, LYS2, LEU2. :tra3, fflS3) on the YAC vector arms are indicated. V„ seg ments 
are classified as genes with open reading frame pseudogenes □, and unsequenced genes O. 
segments are classified as genes with open reading frames and pseudogenes □. The V genes that 
we have found to be utilized by the XenoMouse II are marked (*). The V„ gene region contained 
on yH2 is marked by arrows. 

Figure 2 is a series of Southern Blot analyses and characterizations of the human heavy chain 
YAC, yH2, integrated in ES cells and in XenoMouse strains. Figure 2a is a series of Southern Blot 
analyses of EcoRI (a, c) and BamHI (b, d, e) digested DNA (2^g) prepared from the CGMl 
immortalized B-lymphoblast cell line derived from the Washington University YAC library source 
(Brownstein et al„ 1989), yH2 YAC (0.5 YAC added to 2 ^g of 3B1 DNA), unmodified 
E14TG.3B1 (3BIX and yH2-containing ES cell lines: LIO, J9.2, L18, L17, and J17. The probes used 
for blotting were human V„l (a), D„ (b) [18 kb fragment in CGMl lane represents D segments on 
chromosome 16], V„3 (c), C^i (d) and J„ (e). Figure 2b is a series of Southern Blot analyses of 
EcoRI (a-b) and BamHI (c-d) digested DNA (10 ^g) that was prepared from the tails of wildtype 
(WT, 129xB57BL/6J), XM2A.1, and XM2A.2 (2 individual offspring) mice or from the parental 
yH2-containing ES cell lines LIO (slightly underioaded relative to other samples), J9,2, and 
yK2.containing ES cell line J23.1. The probes used were human V„l (a), V„4 (b), human y-2 (c), 
and mouse 3*-enhancer (d, the 5kb band represents the endogenous mouse 3 -enhancer fragment). 
Fragment sizes of molecular weight markers (in kb) are indicated. 

Figure 3 is a series of Southern Blot analyses characterizing the human kappa light chain 
YAC, yK2, integrated in ES cells and in XenoMouse 2 A Strains. Figure 2a is a series of Southern 
Blot analyses of EcoRI (a, c, d) and BamHI (b, e ) digested DNA (2 pg) prepared from CGMl cell 
line (Brownstein et al., 1989, siipra\ yK2 YAC (0.5 ^g YAC DNA added to 2 ^g of 3B1 DNA), 
unmodified E14TG.3B1 (3B1), and yK2-containing ES cell lines: J23. 1 and J23.7. The probes used 
were human Va (a), Kde (b), VJI (c), VJII (d), and C^ (e). Figure 2b is a series of Southern Blot 
analyses of EcoRI-digested DNA (2 (ig) that was prepared from the tails of wildtype (WT, 129xB6), 
XM2A-1, and XM2A-2 (2 individual offspring) mice or from the parental yH2-containing ES cell 
lines LIO (slightly underioaded relative to other samples), J9.2, and yK2-containing ES cell line J23. 1 . 
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The probes that were used were human VJ (a), VJV (b). V^Vl (c) and 3'-enhancer (d). Fragment 
sizes of molecular weight markers (in kb) are indicated. 

Figure 4 shows B-cell reconstitution and surface expression of human \x, 6, and k chains on 
XenoMouse-derived B-celis and shows flow cytometry analysis of peripheral blood (Fig. 4a) and 
spleen (Fig. 4b) lymphocytes from wildtype mice (WT), double inactivated mice (DI), and 
XenoMouse strains 2A-1 and 2A-2 (XM2A-1, XM2A-2). Four-color flow cytometry analysis was 
carried out using antibodies to the B-cell-specific marker B220 in combination with anti-human fi, 
6, K, or mouse ji, 5, k, or A. The percentage of positively-stained cells is shown in each quadrant. 
Isolation and stainins of cells were performed as described in Materials and Methods. Populations 
of human k* and mouse X* cells were determined after first gating for B220"^i* populations in the 
indicated region. Populations of and 6"^ cells were determined after first gating for B220* cells. 
The percentage of positive cells within a region or quadrant is indicated. The FACS profiles shown 
are representative of several experiments performed on each of the strains. 

Figure 5 shows that XenoMouse-derived human antibodies block the binding of their specific 
antigens to cells. Figure 5a shows the inhibition of labeled [I^^^] IL-8 binding to human neutrophils 
by the mouse anti-human IL-8 antibody (R&D Systems) (□) and the fully human Mabs DM (♦), 
K2.2 (•), K4.2 (A), and K4.3 (T). The background binding of labeled [r^'JIL-S in the absence of 
antibody was 2657 cpm. Figure 5b shows the inhibition of labeled [I^"]EGF to its receptors on A43 1 
cells by mouse anti-human EGFR antibodies 225 and 528 (□, v, respectively; Calbiochem) and the 
fully human antibodies EL1(»), E2,4 (A), E2.5 (T) and E2.11 (♦). The background binding of 
[I^"]EGF in the absence of antibodies was 1060 cpm. Figure 5c shows inhibition of labeled [I*^^] 
TNF-a binding to its receptors on U937 cells by the mouse anti-human TNF-a antibody (R&D 
Systems) (□) and fially human Mabs T22.1 (♦), T22,4 (•), T22.8 (A), and T22.9 (■). The 
background binding of [I^^^JTNF-a in the absence of antibody was 4010 cpm. Control human IgGj 
myeloma antibody (a). 

Figure 6 shows repertoire and somatic hypermutation in XenoMouse-derived fiiUy human 
Mabs. Predicted amino acid sequences of four anti-IL-8 (Fig. 6a) and four anti-EGFR (Fig. 6b) 
human IgG^K Mabs, divided into CDRl, CDR2 and CPR3 and the constant regions, 0^2 and C^. The 
D and J genesj of each antibody are indicated. The amino acid substitutions firom the germline 
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sequences are indicated in bold letters. 

Figure 7 is a schematic diagram of the human heavy chain genome and the human kappa light 
chain genome. 

Figure 8 is another schematic diagram showing the construction of the yH2 (human heavy 
chain) YAC. 

Figure 9 is another schematic diagram showing the construction of the yK2 (human kappa 
light chain) YAC. 

Figure 10 is another schematic diagram showing the construction of the yK2 (human kappa 
light chain) YAC. 

Figure 11 shows a series of Southern Blot analyses demonstrating integration intact of the 
yH2 (human heavy chain) YAC into ES cells and into the mouse genome. Detailed discussion is 
provided in connection with Figure 2. 

Figure 12 shows a series of Southern Blot analyses demonstrating integration intact of the 
yK2 (human kappa light chain) YAC into ES cells and into the mouse genome. Detailed discussion 
is provided in connection with Figure 3. 

Figure 13 shows B-cell reconstitution and surface expression of human ^, 6, and k chains and 
mouse X chains on XenoMouse-derived B-cells and shows flow cytometry analysis of peripheral 
blood. Further details are provided in connection with Figure 4. 

Figure 14 shows production levels of human antibodies by XenoMouse II strains in 
comparison to murine antibody production by wild type mice. 

Figure 15 is a repertoire analysis of human heavy chain transcripts expressed in XenoMouse 
II strains. 

Figure 16 is a repertoire analysis of human kappa light chain transcripts expressed in 
XenoMouse II strains. 

Figure 17 is another depiction of the diverse utilization of human V„ and Vk genes that have 

been observed as utilized in XenoMouse II strains. 

Figure 18 shows the titers of human antibody production in XenoMouse II strains. 
Figure 19 is a depiction of gene utilization of anti-IL-8 antibodies derived from XenoMouse 

II strains. 
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Figure 20 shows heavy chain amino acid sequences of anti-IL-8 antibodies derived from 
XenoMouse II strains. 

Figure 21 shows kappa light chain amino acid sequences of anti-IL-8 antibodies derived from 
' XenoMouse II strains. 

Figure 22 shows blockage of IL-8 binding to human neutrophils by human anti-IL-8 
antibodies derived from XenoMouse II strains. 

Figure 23 shows inhibition of CDl lb expression on human neutrophils by human anti-IL-8 
antibodies derived from XenoMouse II strains. 

Figure 24 shows inhibition of BL-S induced calcium influx by human anti-IL-8 antibodies 
derived from XenoMouse II strains. 

Figure 25 shows inhibition of IL-8 RB/293 chemotaxsis by human anti-IL-8 antibodies 
derived from XenoMouse II strains. 

Figure 26 is a schematic diagram of a rabbit model of human EL-B induced skin inflammation. 

Figure 27 shows the inhibition of human IL-8 induced skin inflammation in the rabbit model 
of Figure 26 with human anti-IL-8 antibodies derived from XenoMouse II strains. 

Figure 28 shows inhibition of angiogenesis of endothelial cells on a rat corneal pocket model 
by human anti-IL-8 antibodies derived from XenoMouse II strains. 

Figure 29 is a depiction of gene utilization of human anti-EGFR antibodies derived from 
XenoMouse II strains. 

Figure 30 shows heavy chain amino acid sequences of human anti-EGFR antibodies derived 
from XenoMouse II strains. 

Figure 31 shows blockage EOF binding to A431 cells by human anti-EGFR antibodies 
derived from XenoMouse II strains. 

Figure 32 shows inhibition of EGF binding to SW948 cells by human anti-EGFR antibodies 
derived from XenoMouse II strains. 

Figure 33 shows that human anti-EGFR antibodies derived from XenoMouse II strains inhibit 
growth of SW948 cells in vitro. 

Figure 34 shows inhibition of TNF-a binding to U937 cells through use of human anti-TNF-a 
antibodies derived from XenoMouse II strains. 
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Figure 35 shows kiappa light chain amino acid sequences of human anti-EGFR antibodies 
derived from XenoMouse II strains. 

DETAILED DESCRIPTION OFTHTC PR EFERRED EMBODIMENTS 

Herein we describe the generation and characterization of several strains of mice containing 
substantially germline configuration megabase-sized human Ig loci. The present invention thus 
provides the first demonstration of reconstruction of the large and complex human Ig loci on YACs 
and the successfiil introduction of megabase-sized YACs into mice to functionally replace the 
corresponding mouse loci. 

M wse Strains 

The following mouse strains are described and/or utilized herein: 

Double Inactivated (PI) Strain : The DI strain of mice are mice that do not produce 
functional endogenous, mouse, Ig. In preferred embodiments, the DI mice possess an inactivated 
mouse J„ region and an inactivated mouse Ck region. The construction of this strain is discussed 
extensively elsewhere. For example, the techniques utilized for generation of the DI strains are 
described in detail in U.S. Patent Application Serial Nos. 07/466,008, filed January 12, 1990, 
07/610,515, filed November 8, 1990, 07/919,297, filed July 24, 1992, 08/031,801, filed March 15, 
1993, 08/1 12,848, filed August 27, 1993, 08/234,145, filed April 28, 1994, 08/724,752, filed October 
2, 1996. See also European Patent No., EP 0 463 151 Bl, grant published June 12, 1996. 
International Patent Application No., WO 94/02602, published February 3, 1994, International Patent 
Application No., WO 96/34096, published October 31, 1996, and PCT Application No. 
PCT/US96/05928, filed April 29, 1996. The disclosures of each of the above-cited patent and patent 
applications are hereby incorporated by reference in their entirety. It has been observed and reported 
that DI mice possess a very immature B-cell development. The mice do not produce mature B-cells, 
only pro-B-cells. 
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KmpMpm^ IStrmn: The design, construction, and analysis of the XenoMouse I strain was 
discussed in detail in Green et al., Nature Genetics, 7:13-21 (1994). Such mice pre .jced IgMK 
antibodies against a DI background. The mice showed improved B-cell function when compared to 
the DI strain of mice which have little to no B-cell development. While XenoMouse I strains of mice 
were capable of mounting a sizeable immune response to antigenic challenge, there appeared to be 
inefficient in their production of B-cells and possessed a limited response to different antigens which 
apparently was related to their limited V-gene repertoire. 

L6 Strain : The L6 strain is a mouse producing IgMK antibodies against a DI background 
of endogenous mouse Ig. L6 mice contain an inserted human heavy chain and an inserted human 
kappa light chain. The L6 strain is generated through breeding of a mouse containing a heavy chain 
insert against a double inactivated background (L6H) and a mouse having a kappa light chain insert 
against a double inactivated background (L6L). The heavy chain insert comprises an intact 
approximately 970 kb human DNA insert from a YAC containing approximately 66 V„ segments, 
starting at Vh6-1 and ending at V„3-65, and including the major D gene clusters (approximately 32), 
J„ genes (6), the intronic enhancer (Em), Cfi, and through about 25 kb past C6, in germline 
configuration. The light chain insert comprises an intact approximately 800 kb human DNA insert 
from a YAC which contains approximately 32 genes starting at V,,.b3 and ending at V^.opii. The 
800 kb insert contains a deletion of approximately 100 kb starting at and ending at 

However, the DNA is in germline configuration from V^.Lp.,3 to 100 kb past V^^,, and also contains 
the genes, the intronic and 3' enhancers, the constant gene, and Kde. The L6H and L6L mice 
have been shown to access the full spectrum of the variable genes incorporated into their genome. 
It is expected that the L6 mice will similarly access the fbll spectrum of variable genes in their 
genome. Furthermore, L6 mice will exhibit predominant expression of human kappa light chain, a 
large population of mature B-cells, and normal levels of IgHc human antibodies. Such mice will 
mount a vigorous human antibody response to multiple immunogens, ultimately yielding 
antigen-specific fijUy human Mabs with subnanomolar affinities. 

XenoMouse Ila Strain: The XenoMouse Ila mice represent our second generation 
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XenoMouse™ strains equipped with germiine configuration megabase-sized human Ig loci^ against 
a DI background, such th^ . the mice do not produce functional endogenous Ig. Essentially, the mice 
are equivalent in construction to the L6 strain, but additionally include the human yl gene with its 
entire switch and regulatory sequences and the mouse 3* enhancer in cis. The mice contain an 
approximatdy 1020 kb heavy and an approximately 800 kb kappa light chain loci, reconstructed on 
YACs, which include the majority of the human variable region genes, including heavy chain genes 
(approximately 66 ¥„) and kappa light chain genes (approximately 32 VJ, human heavy constant 
region genes (n, 6, and y) and kappa constant region genes (C^), and all of the major identified 
regulatory elements. These mice have been shown to access the fiiU spectrum of the variable genes 
incorporated into their genome. Furthermore, they exhibit efficient class switching and somatic 
hypermutation, predominant expression of human kappa light chain, a large population of mature B- 
cells, and normal levels of IgM.. and IgG^ human antibodies. Such mice mount a vigorous human 
antibody response to multiple immunbgens, including human IL-8, human EGF receptor (EGFR), and 
human tumor necrosis factor-a (TNF-a), ultimately yielding antigen-specific fully human Mabs with 
subnanomolar affinities. This last result conclusively demonstrates XenoMouse™ as an excellent 
source for rapid isolation of high affinity, fully human therapeutic Mabs against a broad spectrum of 
antigens with any desired specificity. 

As will be appreciated from the above-introduction, the XenoMouse II strain appears to 
undergo mature B-cell development and mount powerful adult-human-like immune responses to 
antigenic challenge. The L6 strain, as predicted from the data in connection with L6L and L6H mice, 
also appear to undergo mature B-cell development and mount powerful adult-human-like immune 
responses to antigenic challenge. When DI mice are compared to XenoMouse I strains and DI and 
XenoMouse I strains are compared to L6 and XenoMouse II strains, a markedly different B-cell 
development profile is observed. Owing to this difference, it appears that the quantity and/or quality 
of variable region sequences introduced into the animals are essential to the induction B-cell 
maturation and development and the generation of an adult-human-like immune response. Thus, in 
addition to the strains' clear use in the generation of human antibodies, the strains provide a valuable 
tool for studying the nature of human antibodies in the normal immune response, as well as the 
abnormal response characteristic of autoimmune disease and other disorders. 
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Variahle Region - Quantitative Diversity 

It is predicted that ^he specificity of antibodies (i.e., the ability to generate antibodies to a wide 
spectrum of antigens and indeed to a wide spectrum of independent epitopes thereon) is dependent 
'iipon the variable region genes on the heavy chain (V^) and kappa light chain (VJ genome. The 
human heavy chain genome includes approximately 9S functional genes which encode variable regions 
of the human heavy chain of immunoglobulin molecules. In addition, the human light chain genome 
includes approximately 40 genes on its proximal end which encode variable regions of the human 
kappa light chain of immunoglobulin molecules. We have demonstrated that the specificity of 
antibodies can be enhanced through the inclusion of a plurality of genes encoding variable light and 
heavy chains. 

Provided in accordance with the present invention are transgenic mice having a substantial 
portion of the human Ig locus, preferably including both a human heavy chain locus and a human 
kappa light chain locus. In preferred embodiments, therefore, greater than 10% of the human V„ and 

genes are utilized. More preferably, greater than about 20%, 30%, 40%, 50%, 60%, or even 70% 
or greater of V„ and genes are utilized. In a preferred embodiment, constructs including 32 genes 
on the proximal region of the light chain genome are utilized and 66 genes on the V„ portion of 
the genome are utilized. As will be appreciated, genes may be included either sequentially, i.e., in the 
order found in the human genome, or out of sequence, i.e., in an order other than that found in the 
human genome, or a combination thereof Thus, by way of example, an entirely sequential portion 
of either the V„ or genome can be utilized, or various V genes in either the V„ or genome can 
be skipped while maintaining an overall sequential arrangement, or V genes within either the V„ or 

genome can be reordered, and the like. In a preferred embodiment, the entire inserted locus is 
provided in substantially germline configuration as found in humans. In any case, it is expected and 
the results described herein demonstrate that the inclusion of a diverse array of genes from the V„ and 

genome leads to enhanced antibody specificity and ultimately to enhanced antibody affinities. 
Further, preferably such mice include the entire D„ region, the entire Jh region, the human mu 
constant region, and can additionally be equipped with other human constant regions for the coding 
and generation of additional isotypes of antibodies. Such isotypes can include genes encoding Yi, Y2. 
Y3» Y4» ^1 ^ other constant region encoding genes with appropriate switch and regulatory 
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sequences. As will be appreciated, and as discussed in more detail below, a variety of switch and 
regulatory sequences can be appropriately utilized in connection with any particular constant region 
selection. 

The following Table indicates the diversity of antibody combinations that are possible in 
humans, based strictly on random V-D-J joining and combination with kappa light chains, without 
consideration of N-addition or somatic mutation events. Based on these considerations, there are 
greater than 3.8 million possible antibody combinations in humans, of any particular isotype. 



TABLET 



Region 


Heavy Chain 


Kap^ Li^ht Chain 


Variable "V" 


-95 


40 


Diversity "D" 


s32 




Joining 'T* 


6 


5 • 


Combinations (VxDxJ) 


18.240 


200 


Total Combinations 
(HC Combinations x LC 
Combinations) 


3.65 X 10* 



In connection with a preferred embodiment of the invention, through the inclusion of about 
66 Vh genes and 32 genes in a mouse with a full complement of D J „, and J ^ genes, the 
possible diversity of antibody production is on the order of 2.03 X lO'^ different antibodies. As 
before, such calculation does not take into account N-addition or somatic mutation events. 
Therefore, it will be appreciated that mice in accordance with the invention, such as the L6 and the 
XenoMouse II strains, offer substantial antibody diversity. In preferred embodiments, mice are 
designed to have the capability of producing greater than 1X10* different heavy chain V-D-J 
combinations and kappa light chain V-J combinations, without accounting for N-additions or somatic 
mutation events. 
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In addition to quantitative diversity, quantitative selection of V-genes (i.e., large and diverse 
numbers of V-genes) and/or qualitative selection of V-genes (i.e., selection of particular V-genes) 
appears to play a role in what we refer to herein as "qualitative diversity." Qualitative diversity, as 
used herein, refers to diversity in V-D-J rearrangements wherein junctional diversity and/or somatic 
mutation events are introduced. During heavy chain rearrangement, certain enzymes (RAG- 1 . RAG- 
2, and possibly others) are responsible for the cutting of the DNA representing the coding regions, of 
the antibody genes. Terminal deoxynucleotidyl transferase (Tdt) activity is upregulated which is 
responsible for N-terminal additions of nucleotides between the V-D and D-Jgene segments. Similar 
enzymes and others (SCID and other DNA repair enzymes) are responsible for the deletion that 
occurs at the junctions of these coding segments. With respect to junctional diversity, both N- 
addition events and formation of the complementarity determining region 3 (CDR2; are included 
within such term. As will be appreciated, CDR3 is located across the D region and includes the V-D 
and D-J junctional events. Thus, N-additions and deletions during both D-J rearrangement and V-D 
rearrangement are responsible for CDR3 diversity. 

It has been demonstrated that there are certain differences between murine and human 
junctional diversities. In particular, some researchers have reported that murine N-addition lengths 
and CDR3 lengths are generally shorter than typical human N-addition lengths and CDR3 lengths. 
Such groups have reported that, in humans, N-additions of about 7.7 bases in length, on average, are 
typically observed. Yamada et al. (1991). Mouse-like N-additions are more often on the order of 
about 3 bases in length, on average. Feeney et al. (1990), Similarly, human-like CDR3 lengths are 
longer than mouse-like CDR3's. In man CDR3 lengths of between 2 and 25 residues, with an average 
of 14 residues, is common. In mice, some groups have reported shorter average CDR3 lengths. 

The junctional diversity created by N-additions and CDR3 additions play a clear role 
developing antibody specificity. 

In accordance with the invention, rearranged V-D-J gene sequences show N-addition lengths 
that are comparable to expected adult-human N-addition lengths. Further, amino acid sequences 
across the open reading frame (ORF) corresponding to CDR3 sequences show CDR3 lengths that 
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are comparable to expected adult-human CDR3 lengths. Such data is indicative that quantitative 
variable region diversity and/or qualitative variable region diversity results in human-like junctional 
diversity. Such junctional diversity is expected to lead to a more human-like antibody specificity. 

Variahh JHegipn - 4£fmiti^ 

While we have not conclusively demonstrated a direct causal connection between the 
increased variable re^on inclusion and antibody specificity, it appears, and it is expected that through 
providing such diversity, the ability of the mouse to mount an immune response to a wide array of 
antigens is possible and enhanced. Additionally, such mice appear more equipped to mount inmiune 
responses to a wide array of epitopes upon individual antigens or immunogens. From our data it also 
appears that antibodies produced in accordance with the present invention possess enhanced affinities. 
Such data includes comparisons between mice in accordance with the invention and the XenoMouse 
I strains, as well as consideration of the published results of GenPharm International and the MRC. 
In connection with the XenoMouse I strains, as mentioned above, such mice possessed inefficient B- 
cell production and a limited response to different antigens. Such result appeared related in part to 
the limited V-gene repertoire. Similarly, results reported by GenPharm International and the MRC 
indicate a limited response to diverse antigens. 

Without vk^shing to bound to any particular theory or mode of operation of the invention, it 
would appear that enhanced affinities appear to result fi-om the provision of the large number of V 
regions. From our data, the provision of greater numbers and/or selection of qualities of V-gene 
sequences, enhances junctional diversity (N-additions and formation of complementarity determining 
region 3 ("CDR3") diversity), which is typical of an adult-human-like immune response, and which 
play a substantial role in affinity maturation of antibodies. It may also be that such antibodies are 
more effective and efficient in somatic mutation events that lead to enhanced affinities. Each of 
junctional diversity and somatic mutation events are discussed in additional detail below. 

With respect to affinities, antibody affinity rates and constants derived through utilization of 
plural V„ and V,^ genes (i.e., the use of 32 genes on the proximal region of the V,^ light chain genome 
and 66 genes on the V„ portion of the genome) results in association rates (ka in M'^S'^) of greater 
than about 0.50 X 10"^, preferably greater than 2.00 X 10*^, and more preferably greater than about 
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4.00 X 10"^; dissociation rates (kd in S'^) of greater than about 1.00 X 10"*, preferably greater than 
about 2.00 X 10"^, and more preferably greater than about 4.00 X 10"^; and dissociation constant (in 
M) of greater than about 1 .00 X 10''°, preferably greater than about 2.00 X 10'*° and more preferably 
" greater than about 4.00 X 1 0*'°. 

Preferably, such mice additionally do not produce functional endogenous inununoglobulins. 
This is accomplished in a preferred embodiment through the inactivation (or knocking out) of 
endogenous heavy and light chain loci. For example, in a preferred embodiment, the mouse heavy 
chain J-region and mouse kapjsa light chain J-region and C^-region are inactivated through utilization 
of homologous recombination vectors that replace or delete the region. 

Variable Region - B-cell Development 

B-cell development is reviewed in Klzaxs B Lymphocytes (IRL Press (1 990)) and Chapters 1-3 
of Immunoglobulin Genes (Academic Press Ltd. (1989)), the disclosures of which are hereby 
incorporated by reference. Generally, in mammals, blood cell development, including B- and T-cell 
lymphocytes, originate from a common pluripotent stem cell. The lymphocytes, then, evolve from 
a common lymphoid progenitor cell. Following an early gestational period, B-cell initiation shifts 
from the liver to the bone marrow where it remains throughout the life of the mammal. 

In the life cycle of a B-cell, the first generally recognizable cell is a pro-pre-B-cell which is 
found in the bone marrow. Such a cell has begun heavy chain V-D- J rearrangement, but does not yet 
make protein. The cell then evolves into a large, rapidly dividing, pre-B-cell I which is a 
cytoplasmically \i* cell. This pre-B-cell I then stops dividing, shrinks, and undergoes light chain V-J 
rearrangement becoming a pre-B-cell II which expresses surface IgM, which leave the marrow as 
immature B-cells. Most of the emerging immature B-cells continue to develop and to produce 
surface IgD, indicative of their completion of differentiation and development as fully mature 
immunocompetent peripheral B-cells, which reside primarily in the spleen. However, it is possible 
to eliminate the deha constant region and still obtain immunocompetent cells. 

B-cell differentiation and development can be monitored and/or tracked through the use of 
surface markers. For example, the B220 antigen is expressed in relative abundance on mature B-cells 
in comparison to pre-B-cells I or IL Thus, cells that are B220'^ and surface IgM* (>i*) can be utilized 
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to determine the presence of mature B-cells. Additionally, cells can be screened for surface IgD 
expression (5"). Another antigen, heat stable antigen, is expressed by pre-B-cells II as they transition 
to the periphery (i.e., as they become \i* and/or jx*, 6*). 



TABLE II 





Bone Marrow 




Marker 


pro-pre-B-cell 


pre-B-cell I 


emerging B*cell 


immature B-^cell 


mature B-cell j 


B220 






± 




-H- 


HSA 








± 
















6* 










-f 1 



Assuming the presence of a functional copy of the C5 gene on the transgene. 



Through use of B-cell markers, such as those mentioned above, development and 
differentiation of B-cells can be monitored and assessed: 

We have previously demonstrated that DI mice (mice that do not undergo heavy chain V-D-J 
rearrangement or light chain V-J rearrangement) do not produce mature B-cells. In fact, such mice 
arrest at the productioti of pro-pre-B-cells and B-cells never move from the bone marrow to 
peripheral tissues, including the spleen. Thus, both B-cell development and antibody production are 
completely arrested. The same result is seen in mice that are only heavy chain inactivated; B-cell 
development and differentiation arrests in the bone marrow. 

Our XenoMouse I strain produced functional, somewhat mature B-cells. However, the 
numbers of B-cells, in both the bone marrow and peripheral tissues, were significantly reduced 
relative to wild type mice. 

In contrast, our XenoMouse II strains and L6 strains, unexpectedly possess almost complete 
B-cell reconstitution. Therefore, in accordance with the invention, we have demonstrated that 



-22- 



wo 98/24893 



PCT/US97/23091 



through the quantitative inclusion or qualitative inclusion of variable region genes B-cell 
differentiation and development can be greatly reconstituted. Reconstitution of B-cell Hifferentiation 
and development is indicative of immune system reconstitution. In general, B-cell reconstitution is 
compared to wild type controls. Thus, in preferred embodiments of the invention, populations of 
mice having inserted human variable regions possess greater than about S0% B-cell function when 
compared to populations of wild type mice. 

Further, it is interesting to note that production of human antibodies in preference to mouse 
antibodies is substantially elevated in mice ha\dng a knock-out background of endogenous Ig. That 
is to say that mice that contain a human Ig locus and a functionally inactivated endogenous heavy 
chain Ig locus produce human antibodies at a rate of approximately 100 to 1000 fold as efficiently 
as mice that only contain a human Ig locus and are not inactivated for the endogenous locus. 

Isotype Switching 

As is discussed in detail herein, as expected, XenoMouse II mice undergo efficient and 
effective isotype switching from the human transgene encoded mu isotype to the transgene encoded 
ganima-2 isotype. We have also developed XenoMouse II strains that contain and encode the human 
gamma-4 constant region. As mentioned above, mice in accordance with the invention can 
additionally be equipped with other human constant regions for the generation of additional isotypes. 
Such isotypes can include genes encoding Yi. Y2» Y3> Y4» ^» 6» ^^^^d other constant region encoding 
genes. Alternative constant regions can be included on the same transgene, i.e., downstream from 
the human mu constant region, or, alternatively, such other constant regions can be included on 
another chromosome. It will be appreciated that where such other constant regions are included on 
the same chromosome as the chromosome including the human mu constant region encoding 
transgene, cis-switching to the other isotype or isotypes can be accomplished. On the other hand, 
where such other constant region is included on a different chromosome from the chromosome 
containing the mu constant region encoding transgene, trans-switching to the other isotype or 
isotypes can be accomplished. Such arrangement allows tremendous flexibility in the design and 
construction of mice for the generation of antibodies to a wide array of antigens. 

It will be appreciated that constant regions have known switch and regulatory sequences that 
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they are associated with. All of the murine and human constant region genes had been sequenced 
and published by 1989 See Honjo et al. "Constant Region Genes of the Immunoglobulin Heavy 
Chain and the Molecular Mechanism of Class Switching'' in Immunoglobulin Genes (Honjo et al. 
eds.. Academic Press (1989)), the disclosure of which is hereby incorporated by reference. For 
example, in U.S. Patent Application Serial No. 07/574,748, the disclosure of which is hereby 
incorporated by reference, the cloning of the human gamma- 1 constant region was prophesized based 
on known sequence information from the prior art. It was set forth that in the unrearranged, 
unswitched gene, the entire switch region was included in a sequence beginning less than S kb from 
the 5* end of the first y-l constant exon. Therefore the switch region was also included in the 5' 5.3 
kb Hindlll fragment that was disclosed in Ellison et al. Nucleic Acids Res. 10:4071-4079 (1982). 
Similarly, Takahashi et al. Cell 29:671-679 (1982) also reported that the fragment disclosed in Ellison 
contained the switch sequence, and this fragment together with the 7.7 kb Hindlll to BamHI fragment 
must include all of the sequences necessary for the heavy chain isotype switching transgene 
construction. 

Thus, it will be appreciated that any human constant region of choice can be readily 
incorporated into mice in accordance with the invention without undue experimentation. Such 
constant regions can be associated with their native switch sequences (i.e., a human Yi, 2. xoca constant 
r^on with a human Yi.z3,or4 switch, respectively) or can be associated with other switch sequences 
(i.e., a human Y4 constant region with a human Y2 switch). Various 3* enhancer sequences can also 
be utilized, such as mouse, human, or rat, to name a few. Similarly other regulatory sequences can 
also be included. 

As an alternative to, and/or in addition to, isotype switching in vivo, B-cells can be screened 
for secretion of "chimeric" antibodies. For example, the L6 mice, in addition to producing fiilly 
human IgM antibodies, produce antibodies having fiilly human heavy chain V, D, J regions coupled 
to mouse constant regions, such as a variety of gammas (i.e., mouse IgGl, 2, 3, 4) and the like. Such 
antibodies are highly useful in their own right. For example, human constant regions can be included 
on the antibodies through in vitro isotype switching techniques well known in the art. Alternatively, 
and/or in addition, fragments (i.e., F(ab) and F(ab')2 fragments) of such antibodies can be prepared 
which contain little or no mouse constant regions. 
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As discussed above, the most critical factor to antibody production is specificity to a desired 
antigen or epitope on an --ntigen. Class of the antibody, thereafter, becomes important according to 
the therapeutic need. In other words, will the therapeutic index of an antibody be enhanced by 
providing a particular isotype or class? Consideration of that question raises issues of complement 
fixation and the like, which then drives the selection of the particular class or isotype of antibody. 
Gamma constant regions assist in affinity maturation of antibodies. However, the inclusion of a 
humm gamma constant region on a transgene is not required to achieve such maturation. Rather, 
the process appears to proceed as well in connection with ffifilLSE ganmia constant regions which are 
trans-switched onto the mu encoded transgene. 

Materials and methods 
The following Materials and Methods were utilized in connection with the generation and 
charaaerization of mice in accordance with the present invention. Such Materials and Methods are 
meant to be illustrative and are not limiting to the present invention. 

Oonin^ Human Ig-derived YACs : The Washington University (Brownstein et al., 1989) 
and the CEPH (Abertsen et al., 1990) human- YAC libraries were screened for YACs containing 
sequences from the human heavy and kappa light chain loci as previously described (Mendez et al. 
1995). Cloning and characterization of IH and IK YACs was described by Mendez et al., (1995). 
3H and 4H YACs were identified from the Washington University library using a V„3 probe (0.55 
kb Pstl/Ncol, Herman et al, 1988). The 17H YAC was cloned from the GM1416 YAC library and 
determined to contain 130 kb of heavy chain variable sequences and a 150 kb chimeric region at its 
3' end Matsuda et. al., 1993. 2K and 3K YACs were recovered from the CHEF library using VJI- 
specific primer (Albertsen et al., 1990). 

YAC targeting and recomhination : Standard methods for yeast growth, mating, sporulation, 
and phenotype testing were employed (Sherman et al, 1986). Targeting of YACs and YAC vector 
arms with yeast and mammalian selectable markers, to facilitate the screening of YAC recombinants 
in yeast of YAC integration into cells, was achieved by lithium acetate transformation (Scheisd and 
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Geitz (1989). After every targeting or recombination step the modified YAC(s) was analyzed by 
pulsed field gel cletrophoresis an-^ standard Southern Blots to determine the integrity of all sequ'^nces. 

YAC targeting vectors were used for the interconversion of centric and acentric arms to 
reorient 17H and to retrofit its 5* arm with LEU2 and URA3 genes and its 3* arm with the HIS3 gene. 
5 See Fig. la and Mendez et al, 1993. The 4H centric arm was retrofitted with the yeast ADE2 gene 
and the human HPRT selectable markers. For the first recombination step, a diploid yeast strain was 
created and selected in which all three YACs 17H, 3H, and 4H were present, intact, and stably 
maintained. A three-way homologous recombination between the YAC overlapping regions was 
induced by sporulation and the desired recombinant was found by the selection of the outer yeast 
1 0 seleaable markers (ADE2 and HIS3) and negative selection (loss) of the internal marker URA3. The 
successfiil recombination created a 880 kb YAC containing 80% of the IgH variable region, starting 
at Vj,2-5 and extending 20 kb 5* of the V„3*65 gene. For the recombination of the 880 kb YAC to 
IH, IH was retrofitted with pICL, which adds the LYS2 gene to the centric arm (Hermanson et al., 
1991). Using standard yeast mating, a diploid strain was selected containing both IH and the 880 
15 kb YAC. Upon sporulation and by use of overlapping homology, YAC-yeast recombination was 
carried out. With positive selection for the outer yeast markers (ADE2 and URA3) and screening 
for the loss of the internal markers (TRJPl, LYS2, HIS3), an intact 970 kb YAC consisting of 
approximately 66 V„ segments, starting at V„6-l and ending at Vh3-65 was found. The YAC also 
contained the major D gene clusters, J„ genes, the intronic enhancer (Eji), C^, up to 25 kb past C6, 
20 in germline configuration. This 970 kb YAC was then retrofitted with a targeting vector including 
a 23 kb EcoRI genomic fragment of the human gene, including its switch and regulatory 
elements, a 7 kb Xbal fi'agment of the murine heavy chain 3' enhancer, neomycin gene driven by the 
metallothionine promoter (MMTNeo), and the yeast LYS2 gene. This vector, while bringing in these 
sequences on the 3' YAC arm, disrupts the URA3 gene. 
25 As a first step toward creating yK2 YAC, by standard yeast mating a diploid yeast strain was 

selected in which retrofitted IK and 3K YACs were both present, intact, and stably maintained. 
Using the same process as described in connection with the IgH construction, YAC-yeast 
recombination was carried out. Through use of positive seleaion for the outer yeast markers (LYS2, 
TRPl) and the screening for the loss of internal markers (URA3, TRPl), an intact 800 kb 
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recombinant product was found which contained 32 starting at V^.b3 and ending at V^^,i. The 
800 kb YAC contains a deletion of approximately 100 kb starting at and ending at V^,li>.3- 

However, the YAC is in germline configuration from to 100 kb past V^^|. The YAC also 

contains J^, the intronic and 3' enhancers, the constant C.^, and Kde. 

YAC introduction into ES cells and mice: YAC<containing yeast spheroplasts were fused 
vnth E14.TG3B1 ES cells as described (Jakobovits et al., 1993a; Green et al., 1994). HAT-resistant 
colonies were expanded for analysis. YAC integrity was evaluated by Southern Blot analysis using 
protocols and prober described in Herman et al., (1988) and Mendez et al., (1994) and hybridization 
conditions as described in Gemmil et al., (1991). Chimeric mice were generated by microinjection 
of ES cells into C57BL/6 blastocysts, YAC-containing offspring were identified by PCR analysis of 
tail DNA as described (Green et al., 1 994). YAC integrity was evaluated by Southern Blot analysis 
using probes and conditions previously described, except that the blot probed with human V„3 was 
washed at 50^C. 

JFiow cytometry analysis: Peripheral blood and spleen lymphocytes obtained from 8-10 week 
old XenoMice and control mice were purified on Lympholyte M (Accurate) and treated with purified 
ami -mouse CD32/CD16 Fc receptor (Pharmingen, 01 24 ID) to block non-specific binding to Fc 
receptors, stained with antibodies and analyzed on a FACStaH'^^^ (Becton Dickinson, CELLQuest 
software). Antibodies used: allophycocyanin (APC) anti-B220 (Pharmingen, 01129A); biotin 
anti-human IgM (Pharmingen, 08072D); biotin anti-mouse IgM (Pharmingen, 02202D); fluoroscein 
isothiocyanate (FITC) goat F(ab')2 anti-human IgD (Southern Biotechnology, 2032-02); FITC 
ann-mouse IgD" (Pharmingen, 05064D); FITC anti-mlgD** (Pharmingen, 05074D); FITC anti-mouse 
A (Pharmingen, 02174D); PE anti-human k (Pharmingen, 08 175 A); PE anti-mouse k (Pharmingen, 
02155A.) RED613™.streptavidin (GibcoBRL, 19541-010) was used to detect biotinylated 
antibodies. 

Immunizxition and hvhridonta ^eneratiof^: XenoMice (8 to 10 weeks old) were immunized 
intraperitoneally with 25 ^g of recombinant human IL-8 or with 5 |ig TNF-a (Biosource 
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International) emulsified in complete Freund's adjuvant for the primary immunization and in 
incomplete Fr«ind*s adjuvant for the additional immunizations carried out at two week intervals. For 
EGFR immunization, XenoMice were immunized intraperitoneally with 2x10' A431 (ATCC 
CRL*7907) cells resuspended in phosphate buffered saline (PBS). This dose was repeated three 
S times. Four days before fusion, the mice received a final injection of antigen or cells in PBS. Spleen 
and lymph node lymphocytes from immunized mice were fused with the non-secretory myeloma 
N.SO-bcl2 line (Ray and Diamond, 1994), and were subjected to HAT selection as previously 
described (Galfre and Milstein, 1981). 

10 ELISA assay : ELISA for determination of antigen-specific antibodies in mouse serum and 

in hybridoma supematants were carried out as described (Coligan et al., 1994) using recombinant 
human IL-8 and TNF-a and aflSnity- purified EGFR from A431 cells (Sigma, E-3641) to capture the 
antibodies. The concentration of human and mouse immunoglobulins were determined using the 
following capture antibodies: rabbit anti-human IgG (Southern Biotechnology, 6145-01), goat 
15 anti-human IgK (Vector Laboratories, AJ-3060), mouse anti-human IgM (CGI/ ATCC, HB-57), for 
human y, k, and \x Ig, respectively, and goat anti-mouse IgG (Caltag, M 30100), goat anti-mouse IgK 
(Southern Biotechnology, 1050-01), goat anti-mouse IgM (Southern Biotechnology, 1020-01), and 
goat anti-mouse A (Southern Biotechnology, 1060-01) to capture mouse y, k, |i, and X Ig, 
respectively. The detection antibodies used in ELISA experiments were goat anti-mouse IgG-HRP 

20 (Caltag, M-30107), goat anti-mouse IgK-HRP (Caltag, M 33007), mouse anti-human IgG2-HRP 
(Southern Biotechnology, 9070-05), mouse anti-human IgM-HRP (Southern Biotechnology, 
9020-05), and goat anti-human kappa-biotin (Vector, BA-3060). Standards used for quantitation of 
human and mouse Ig were: human IgGj (Calbiochem, 400122), human IgMic (Cappel, 13000), 
human IgGjK (Calbiochem, 400122), mouse IgGK (Cappel 55939), mouse IgMK (Sigma. M-3795), 

25 and mouse IgGjA (Sigma, M-9019). 

Determination of affinity constants of fully human Mahs by BIAcore : Affinity 
measurement of purified human monoclonal antibodies. Fab fragments, or hybridoma supematants 
by plasmon resonance was carried out using the BIAcpre 2000 instrument, using general procedures 
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outlined by the manufacturers. 

Kinetic analysis of the antibodies was carried out using antigens immobilized onto the sensor 
surface at a low density: human IL-8 -81 RU, soluble EGFR purified from A431 cell membranes 
' (Sigma, E-3641> 303 RU, and TNF-a- 107 RU (1.000 RU correspond to about 1 ng/mm^ of 
immobilized protein). The dissociation (kd) and association (ka) rates were determined using the 
software provided by the manufacturers, BIAevaluation 2.1. 

Affinity measurement bv radioimmunoassay: '^^Mabeled human IL-8 (1.5 x 10 " M or 3 
X 10*" M) was incubated with purified anti-IL-8 human antibodies at varying concentrations (5 x 10'*^ 
M to 4 X lO-** M) in 200 m1 of PBS with 0.5% BSA. After 15 hrs. incubation at room temperature, 
20 nl of Protein A Sepharose CL-4B in PBS (l/l, v/v) was added to precipitate the antibody-antigen 
complex. After 2 hrs. incubation at 4^C. the antibody- *"ML-8 complex bound to Protein A 
Sepharose was separated from fi-ee '"I-IL-S by filtration using 96-welI filtration plates (Millipore, 
Cat. No. MADVN65), collected into scintillation vials and counted. The concentration of bound and 
free antibodies was calculated and the binding affinity of the antibodies to the specific antigen was 
obtained using Scatchart analysis (2). 

Receptor binding assays: The IL-8 receptor binding assay was carried out with human 
neutrophils prepared either fi*om freshly drawn blood or from buffy coats as described (Lusti- 
Marasimhan et al., 1995). Varying concentrations of antibodies were incubated with 0.23 nM 
['"I]IL-8 (Amersham, IM-249) for 30 min at 4 t in 96-well Multiscreen filter plates (Millipore, 
MADV N6550) pretreated with PBS binding buffer containing 0.1% bovine serum albumin and 
0.02% NaNj at 25^C for 2 hours. 4X10' neutrophils were added to each well, and the plates were 
incubated for 90 min at 4^C. Cells were washed 5 times with 200 pi of ice-cold PBS, which was 
removed by aspiration. The filters were air-dried, added to scintillation fluid, and counted in a 
scintillation counter. The percentage of specifically bound [*"l]IL-8 was calculated as the mean cpm 
detected in the presence of antibody divided by cpm detected in the presence of buffer only. 

Binding assays for TNF receptor were performed in a similar manner as the IL-8 assays 
described above. However, the human monocyte line U937 was utilized instead of the neutrophil line 
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used in connection with the IL-8 assays. Antibodies were preincubated with 0.25 nM ['^^TNF 
(Amersham, IM-206). 6x 10^ U937 cells were placed in each well. 

The EGF receptor binding assay was carried out with A43 1 cells (0.4 x 10* cells per well) 
which were incubated with varying concentrations of antibodies in PBS binding buffer for 30 minutes 
at 4*^C. 0.1 nM ['"I]EGF (Amersham, IM-196) was added to each well, and the plates were 
incubated for 90 min at 4°C. The plates were washed five times, air-dried and counted in a 
scintillation counter. Anti-EGFR mouse antibodies 22S and 528 (Calbiochem) were used as controls. 

Repertoire analysis o f human Ig transcripts expressed in XenoMice and their derived 
human Mats : Poly(A)* mRNA was isolated from spleen and lymph nodes of unimmunized and 
immunized XenoMice using a Fast-Track kit (Invitrogen), The generation of random primed cDNA 
was followed by PCR. Human V„ or human family specific variable region primers (Marks et. al.. 
1991) or a universal human V„ primer, MG-30 (CAGGTGCAGCTGGAGCAGTCIGG) was used 
in conjunction with primers specific for the human (hpP2) or Ck (hKP2) constant regions as 
previously described (Green et al., 1994), or the human yl constant region MG-40d; 
5*-GCTGAGGGAGTAGAGTCCTGAGGA-3\ PCR products were cloned into pCRIl using a TA 
cloning kit (Invitrogen) and both strands were sequenced using Prism dye-terminator sequencing kits 
and an ABI 377 sequencing machine. Sequences of human Mabs-derived heavy and kappa chain 
transcripts were obtained by direct sequencing of PCR products generated from poly(A^) RNA using 
the primers described above. All sequences were analyzed by alignments to the "V BASE sequence 
directory" (Tomlinson et al., MRC Centre for Protein Engineering, Cambridge, UK) using Mac Vector 
and Geneworks software programs. 

Preparation and puri fication o f antibody Fab fragments : Antibody Fab fragments were 
produced by using immobilized papain (Pierce). The Fab fragments were purified with a two step 
chromatographic scheme: HiTrap (Bio-Rad) Protein A column to capture Fc fragments and any 
undigested antibody, followed by elution of the Fab fragments retained in the flow-through on strong 
cation exchange column (PerSeptive Biosystems), with a linear salt gradient to 0,5 M NaCl. Fab 
fi-agments were characterized by SDS-PAGE and MALDI-TOF MS under reducing and non-reducing 
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conditions, demonstrating the expected -50 kD unreduced fragment and -25 kDa reduced doublet. 
This result demonstrates the intact light chain and the cleaved heavy chain. MS u"der reducing 
conditions permitted the unambiguous identification of both the light and cleaved heavy chains since 
the light chain mass can be precisely determined by reducing the whole undigested antibody. 

EXAMPLES 

The following examples, including the experiments conducted and results achieved are 
provided for illustrative purposes only and are not to be construed as limiting upon the present 
invention. 

jKxffmpfe l; Reconstru ction of human heavy chain loci on YACs 

In accordance with the present invention, the strategy that we utilized to reconstruct the 
human heavy chain and human kappa light chain variable regions was to, first, screen human- YAC 
libraries for YACs that spanned the large (megabase-sized) human Ig loci and, second, to recombine 
YACs spanning such regions into single YACs containing the desired loci predominantly in germline 
configuration. 

The above, stepwise, YAC recombination scheme exploited the high frequency of 
meiotic-induced homologous recombination in yeast and the ability to select the desired recombinants 
by the yeast markers present on the vector arms of the recombined YACs (See Figure 1, and Green 
et al., si(pra. \see also Silverman et al., 1990 and denDunnen et aL, 1992). 

In connection with our strategy, we identified four YACs, IH (240 kb), 2H (270 kb), 3H (300 
kb), and 4H (340 kb), which spanned about 830 kb, out of the about 1000 kb, of the human heavy 
chain variable region on chromosome 14q. YACs IH, 2H, 3H, and 4H were used for reconstruction 
of the locus {See Figure 1 A). Pulsed Field Gel Electrophoresis (PFGE) and Southern blot analysis 
confirmed the YACs to be in intact, germline configuration, with the exception of 150 kb at the 3' end 
of YAC 2H which contained certain non-IgH sequences {See Figure 1; Matsuda et al., 1990). YAC 
IH, the YAC that was previously introduced into our first generation XenoMouse™ (Green et al, 
supra. \ Mendez et al., 1995), is comprised of the human C5, C^, and D„ regions and the first 5 V„ 
genes in germline configuration. The other three YACs cover the majority of the Vh region, from 
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V„2-5 to V„3-65, thus contributing approximately an additional 61 different V„ genes. Prior to 
recombination, YAC 4H "as retrofitted with an HPRT selectable marker. Through utilization of the 
overlapping sequences contained on the YACs, the four YACs (IH, 2H, 3H, and 4H) were 
recombined in yeast by a stepwise recombination strategy (See Figure 1 A). Such recombination 
strategy generated a 980 kb recombinant YAC (See Figure 1). Analysis of the YAC by PFGE and 
Southern blot analysis confiirmed the presence of the human heavy chain locus &om the C^ region to 
20 kb 5' of the Vh3-65 gene in germline configuration. No apparent deletions or rearrangements were 
observed. 

The YAC acentric arm was targeted with a vector bearing the complete human y2 constant 
region, mouse 3* enhancer, and the neomycin resistance gene, to yield the final 1020 kb heavy chain 
YAC, yH2. YAC yH2 contained the majority of the human variable region i.e., 66 out of the 82 V„ 
genes, complete D„ (32 genes), and J„ (6 genes) regions and three different constant regions (C^, 
C6, and Cy) with their corresponding regulatory sequences (See Figure 1 A). This was the heavy 
chain construct utilized for the production of our XenoMouse II strains. 

Example 2: Reconstruction of human kappa light chain loci on YACs 

A similar stepwise recombination strategy was utilized for reconstruction of the human kappa 
light chain locus. Three YACs were identified that spanned the human kappa loci. The YACs were 
designated IK, 2K and 3K. YAC IK, which had a length of approximately 180 kb, had previously 
been introduced into our first generation XenoMouse™. Such YAC contained the kappa deleting 
element, (Kde), the kappa 3' and intronic enhancers, C^, J^, and the three genes on the B cluster 
(Green et al., 1994; Mendez et aJ., 1995). YAC 2K (approximately 480 kb), and 3K (approximately 
380 kb) together encompass most of the kappa chain proximal variable region on chromosome 2p. 
A deletion of approximately 100 kb spans the L13-L5 region (Fig. IB; Huber et al, 1993). Inasmuch 
as the kappa distal region duplicates the proximal region, and as the proximal genes are the ones 
most commonly utilized humans (Weichold et al., 1993; Cox et al., 1994), the proximal region was 
the focus of our reconstruction strategy (Fig. IB). Through homologous recombination of the three 
YACS, an 800 kb recombinant YAC, yK2, was recovered. The size and integrity of the recombinant 
YAC was confirmed by PFGE and Southern blot analysis. Such analysis demonstrated that it covered 
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the proximal part of the human kappa chain locus, with 32 genes in germline configuration except 
for the described deletio ' in the Lp region (Fig. IB). yK2 centric and acentric arms were modified 
to contain the HPRT and neomycin selectable markers, respectively, as described (Materials and 
Methods). This was the kappa light chain construct utilized for the production of our XenoMouse 
II strains. 

The YACs described herein, yH2 and yK2, represent the first megaibase-sized reconstructed 
human Ig loci to contain the majority of the human antibody repertoire, predominantly in germline 
configuration. This accomplishment further confirmed homologous recombination in yeast as a 
powerful approach for successful reconstruction of large, complex, and unstable loci. The selection 
of stable YAC recombinants containing large portions of the Ig loci in yeast provided us with the 
human Ig fragments required to equip the mice with the human antibody repertoire, constant regions, 
and regulatory elements needed to reproduce human antibody response in mice. 

Example 3: Introduction of yH2 and yK2 YACs into ES cells 

In accordance with our strategy, we introduced the YACs, yH2 and yK2, into mouse 
embryonic stem (ES) cells. OnceES cells containing the YAC DNA were isolated, such ES cells 
were utilized for the generation of mice through appropriate breeding. 

In this experiment, therefore, YACs yH2 and yK2, were introduced into ES cells via fusion 
of YAC-containing yeast spheroplasts with HPRT-deficient E14.TG3B1 mouse ES cells as previously 
described (Jakobovits et al., 1993a; Green ei al., 1994). HPRT-positive ES cell clones were selected 
at a frequency of 1 clone/ 1 5-20x10^ fiised cells and were analyzed for YAC integrity by Southern and 
CHEF blot analyses (Fig. 2A). 

Seven of thirty-five ES cell clones (referred to as LIO, J9.2, L17, LIS, J17, L22, L23) derived 
from ES cell fiision with yH2 -containing yeast were found to contain all expected EcoRI and BamHI 
yH2 fragments detected by probes spanning the entire insert: mouse 3' enhancer, human intronic 
enhancer, human C^2, Cg, and C^ constant regions, D„, J„ and all the different V„ families: VhI, V„2, 
Vh3, Vh4, V„5, and Vf,6 (data shown for 5 clones in Fig. 2A). CHEF analysis further confirmed that 
these clones, which represent 20% of all clones analyzed, contain the entire intact yH2 YAC with no 
apparent deletions or rearrangements (data not shown). 
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ES cell clones derived from the fusion of yK2-containing yeast were similarly analyzed for 
YAC integrity, using probes specific for the human Kde, kappa 3' and intronic enhancers, J . and 
all of the diflferent families: VJ, V^I, VJII, V^IV, V^VI. Twenty clones of the sixty clones had 
intact and unaltered YAC, which represent 30% of total clones analyzed (data shown for two ES 
clones in Fig. 3 A). Varying amounts of yeast genomic sequences were detected in yH2 and yK2-ES 
cell clones (data not shown). 

These results are the first demonstration of introduction of megabase-sized constructs 
encompassing reconstructed human loci, predominaritly in germline configuration, into mammalian 
cells. The relatively high frequency of intact YACs integrated into the mouse genome fiarther 
validated the ES cell-yeast spheroplast fiision methodology as an effective approach for faithful 
introduction of large human genomic fragments into ES cells. 

pxfimple 4: Generation ofXenoMouse 11 strains 

In order to generate mice from the YAC DNA conuining ES cells, microinjection of 
blastocysts was conducted, followed by breeding. Thus, yH2. and yK2-bearing ES cell clones were 
expanded and microinjected into mouse C57BL/6J blastocysts (Green et al., 1994) and the chimeric 
males produced were evaluated for germline transmission. Offspring with transmitted YAC were 
identified by PCR analysis and the YAC integrity was confirmed by Southern blot analysis. In all 
transgenic mice analyzed the YAC was shown to be in intact form (Fig.2B, 3B). All seven 
microinjected yH2-ES clones and two out of eight yK2-ES clones were transmitted through the 
mouse germline. 

In order to generate mice that produced human antibodies to the exclusion of endogenous 
antibodies, yH2- or yK2 -transgenic mice were bred with double-inactivated (D I) mouse strains. The 
DI mouse strains are homozygous for gene targeted-inactivated mouse heavy and kappa chain loci 
and thus are deficient in antibody production (Jakobovits et al., 1993b; Green et al., 1994). Two of 
the yH2- transgenic mouse strains LIO and J9.2, and one of the yK2-transgenic mouse strains, J23.1, 
were bred with DI mice to generate mice bearing YACs on an homozygous inactivated mouse heavy 
and kappa chain background (yH2;DI, and yK2;DI). Each of the yH2;DI transgenic strains were bred 
with the yK2;DI transgenic strain to generate two XenoMouse II strains, 2A-1 (L10;J23.1;DI) and 



-34- 



o 



wo 98/24893 PCTAJS97/23091 

2A-2 (J9.2;J23.1;DIX respectively, containing both heavy and light chain YACs on homozygous DI 
background. LIO is fully homozygous and J9.2 and J23. 1 are in the process of being successfully 
bred to homozygosity. 

The integrity of the human heavy and kappa chain YACs in XenoMouse II strains was 
confirmed by Southern blot analysis. As shown in Fig. 2 and Fig. 3, in both XenoMouse strains 
analyzed, yH2 and yK2 were transmitted unaltered through multiple generations with no apparent 
deletions or rearrangements. 

Example S: B-cef! development and human antibody production hv XenoMouse II mice 

In order to further characterize the XenoMouse n strains, we studied their B-cell development 
and their production of human antibodies. Reconstitution of B-cell development and antibody 
production in XenoMouse II strains by yH2 and yK2 YACs was evaluated by flow cytometry and 
ELISA. In contrast to DI mice, which completely lack mature B-cells, XenoMouse II manifested 
essentially normal B-celi development with the mature B-cell population in the blood totaling over 
50% of the level seen in wild type mice (Fig. 4A). All B-cells were shown to express human IgM and 
high levels of B220 (human IgM"/B220'^), with 60% of this population also expressing human IgD. 
Similar results were obtained from analysis of XenoMouse spleen and lymph nodes (not shown). 
These results correlate well with the characteristics of mature B-cells in wild type mice, indicating 
proper B-cell maturation in XenoMouse. 

The majority of XenoMouse B-cells (75-80%) expressed exclusively human kappa (K)light 
chain, whereas only about 15% expressed mouse lambda {X) light chain (Fig. 4). This light chain 
distribution ratio (hK/mA; 75:15) is comparable to that observed in wild type mice, indicating a 
mouse-like regulation of light chain utilization. In contrast, XenoMouse I, as described in Green et 
al., 1994, showed a ratio of hic/mA: 55:45 (data not shown). Similar observations were made for B- 
cells from spleen (Fig. 4B) and lymph nodes (not shown), indicating that most of XenoMouse II's 
B-cells produced exclusively fully human antibodies. Levels of mX-expressing B-cells were reduced 
from 15% to 7% in XenoMouse II strains homozygous for yK2 (data not shown). 
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Example 6 generation of L6 Strain 

The L6 strain of mice were generated identically to the process described above in connection 
with the generation of the XenoMouse II strains. However, oyidng to a deletion event during the 
generation of the L6 ES cell line, the ES cell line, and, subsequently, the L6 mouse evolved without 
a portion of the sequence distal to C5, thus, eliminating the Cy constant region and its regulatory 
sequences. Following completion of breeding, the L6 mice will contain the entire yK2 construct and 
the entire yH2 construct, except for the missing Cy constant region. 

Example 7: Human Antibody Production 

Expression of human C^, Cy2^ and k light .chains were detected in unimmunized XenoMouse 
n sera at maximal levels of 700, 600, and 800 fig/ml, respectively. To determine how these values 
compared to wild-type, we measured maximal levels of mouse Cfi, Cy2, and k light chains in 
C57BL/6J X 129 mice kept under similar pathogen-free conditions. The values for C^, CyS, and k 
light chain in wild-type mice were 400, 2000, and 2000 ^g/ml, respectively. Upon immunization, the 
human y chain levels increased to approximately 2.5 mg/mi. The concentration of mouse k was only 
70 ^ig/ml, further confirming the preferential use of human kappa chain. 

These findings confirmed the ability of the introduced human Ig YACs to induce proper Ig 
gene rearrangement and class switching and to generate significant levels of fiilly human IgM and IgG 
antibodies before and after immunization. 

Example 8: A diverse human antibody repertoire in XenoMouse II 

In order to further understand the reconstitution of the antibody repertoire in XenoMouse II 
strains, we challenged mice with several antigens, and prepared hybridoma cell lines secreting such 
antibodies. As will be understood, recapitulation of the human antibody response in mice requires 
diverse utilization of the different human variable genes contained on yH2 and yK2 YACs. The 
diversity of the human antibodies generated by XenoMouse II strains was determined by cloning and 
sequencing human heavy chain ()i and y) and kappa light chain transcripts from XenoMouse lymph 
nodes. Based upon our data to date, sequence analysis demonstrates that XenoMouse II utilizes at 
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least 1 1 out of the 37 functional V„ genes present on yH2, eight different D„ segments and three J„ 
genes (Jh3, Jh4> ^ne) (Table III; J„5 was also detected in connection with our sequencing antibodies 
from hybridomas). V-D-J sequences were linked to human |i or y2 constant regions (not shown). 

The Vh genes utilized are widely distributed over the entire variable region and represent four 
out of the seven Vh families (Table III). The predominant utilization of V genes from and V„4 
families is similar to the Vh usage pattern in adult humans, which is proportional to family size 
(Yamada et al. 1991; Brezinshek et al., 1995). The predominant usage of J„4 is also reminiscent of 
that detected in human B-cells (Brezinshek et al., 1995). Addition of non-germline nucleotides 
(N-additions) at both V-D and D-J joinings, ranging from 1-12 bp, were also observed. Such N- 
additions produced complementary determining regions 3 (CDR3s) with lengths of from 8 to about 
19 amino acid residues, which is very comparable to that observed in adults human B-cells (Yamada 
et al. 1991; Brezinshek et al., 1995). Such CDR3 lengths observed in the XenoMouse II are much 
longer than CDR3 lengths ordinarily observed in mice (Feeny, 1990). 

A highly diverse repertoire was also found in the ten kappa chain transcripts sequenced. In 
addition to displaying 8 out of the 25 Vk functional open reading frames (ORFs) present on yK2, all 
of the Jic genes were detectable (Table IV). The different Vk genes utilized were widely dispersed 
throughout yK2, representing all four major Vk gene families. All VkJk recombination products 
were linked properly to Ck sequences. The paucity of N-additions in our transcripts is in agreement 
with the greatly reduced terminal deoxynucleotide transferase activity at the stage of kappa chain 
rearrangement. The average CDR3 length of 9-10 amino acids that we observed in the kappa chain 
transcripts is identical to that observed in human B-cells (Marks et al., 1991). 

In Tables III and IV below, repertoire analyses of human heavy and kappa light chain 
transcripts expressed in XenoMouse II strains are presented. Human ji, y, and k specific mRNAs 
were amplified by PCR, cloned and analyzed by sequencing as described in Materials and Methods. 
Table in shows a series of nucleotide sequences of 12 unique human heavy chain clones, divided into 
Vh, D, J„ and N segments, as identified by homology with published germline sequences (Materials 
and Methods). Each D segment assignment is based on at least 8 bases of homology. Table IV 
shows a series of nucleotide sequences of V-J junctions of 8 independent human k clones. The 
sequences are divided into V„ J„ and N segments and identified based on homology to published V^ 
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and sequences. In each of the Tables N-additions and deletions (indicated as _) were determined 
by their lack of sequence homology to V, D, or J sequences. 



TABLE III 

Repertoire Analysis of Human Heavy Chain Transcripts 



Clone 


y« — ^ — . 


N 


D, 1 N 




AZ2.1 


5-51 (DP73) 
TTACTGTCCGAGACA 


4 

(TAGC) 


XP5rc 
AATCAT 


12 

(GCOAGCTACGGG) 


JH4 GACTACTGGGGC 


B2.1.5 


3-33 (DP-50) 
TTACTGTGCGAGAGA 


7 

(TCOGGGA) 


3rc 
AATAGCA 


7 

(CTGGCCT) 


JH4 _CTTTGACTACTGGGGC 


B4.2.4 


3-15(DP-38) 
TTACTGTACCACAGA 


1 

(G) 


Kl 

GGCTAC 


11 

(ACTAACTACCC) 


JH6 CTACTACTACTACGGT 


B4.2.5 


4-59(DP-71) 
TTACTGTGCGAGAGA 


10 

(TAGGAGTGTT) 


4 

GTAGTACCAGCTGCTAT 


6 

(ACCCAA) 


JH6 ACTACTACTACTACGGT 


D2.2.5 


4-34 (DP-63) 
TTACTGTGCGAGAG_ 


2 

(GG) 


NI * 

GCAGCAGCTG 


4 

(CCCT) 


JH4 _CTTTGACTACTGGGGC 


D2.I.3 


3.48 (DP-31) 
TTACTGTGCGAGAGA 


4 

(TCTT) 


XPl 

GATATTTTGACTGGT 


2 

(CT) 


JH6 CTACTACTACTACGGT 


D2^.8 


4-31 (DP-65) 
TTACTGTGCGAGAGA 


2 

(OA) 


A4 

GACTGCAG 


5 

(CGGTT) 


JH4 TTTGACTACTGGGGC 


A2.2.4 


3-21 (DP-77) 
TTACTGTGCGAGAGA 


2 

(TT) 


IR3 
GOGGCTGG 


3 

(ACC) 


JH6 ^TACTACTACTACTACGGT 


D4.2.\\ 


4-4/4.35 
ATTACTGTOCGA 


1 

(A) . 


Nl 

TATAGCAGTGGCTGGT 


2 

(GT) 


JH4 CTTTGACTACTGGGGC 


Cl.2.1 


1-!8(DP-14) 
TATTACTGTGCGAG^ 


0 


XP*l/'2l-7 
GTTA 


0 


JH4 GACTACTGGGGC 


C3.I.2 


4-39 (DP- 79) 
TATTACTGTGCG 


3 

(GCC) 


2 

GGATATAGTAGTGG 


6 

(TCGGGC) 


JH4 CTTTGACTACTGGGGC 


D2.2.7 


5-51 (DP73) 
TTACTGTGCGAGACA 


4 

fTGGC) 


Kl 

AGTGGCT 


9 

. rGGTACTCTG) 


JH3 ATGCTTTGATATCTGGGG 
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TABLE IV 

Repertoire Analysis of Human Kappa Light Thain TransrHpts 



Clone 


Vk 


N 


J»c 


F2.2.3 


02 (DPK9) 
TTAAACGAACAGTACCCC 


0 


Jic5 GATCACCTTCGGCCAA 


F4.l.g 


L5 (DPK3) 
ACAGGCTAACAGTTTCCCTC. 


0 


_GGACGTTCGGCCAA 


F4.1.6 


A20(DPK4) 

AAGTATAACAGTGCCCC 


0 


Jic3 ATTCACTTTCOGCCCT 


F2.2.5 


08 

ACAGTATGATAATCTCCC 


0 


Jic4 GCTCACTTTCOGCGGA 


F2.1.5 


L! 

AAAGTATAATAGTTACCC 


0 


Jic5 GATCACCTTCGGCCAA 


F2.I.4 


A30 

CAGCATAATAGTTACCC 


0 


Jic3 ATTCACTTTCGGCCCT 


F2.I.3 


B3 (DPK24) 
AATATTATAGTACTCC 


0 


Jic4 GCTCACTTTCOGCOOA I 


F4.1.3 


A27(DPK22) 
CAOTATOGTAGCTCACCTC 


1 

<G) 


Jic2 CACTTTTGGCCAG 



These results, together with sequences of XenoMouse-derived hybridomas described later, 
demonstrate a highly diverse, adult human-like utilization of V, D, and J genes, which appears to 
demonstrate that the entire human heavy and kappa chain variable regions present on the yH2 and 
theyK2 YACs are accessible to the mouse system for antibody rearrangement and are being utilized 
in a non-position-biased manner. In addition, the average length of N-additions and CDR3s for both 
the heavy and kappa chain transcripts, is very similar to that seen in adult human B-cells, indicating 
that the YAC DNA contained in the mice direct the mouse machinery to produce an adult human-like 
immune repertoire in mice. 

In connection with the following Examples, we prepared high affinity antibodies to several 
antigens. In particular, antigens were prepared to human XL- 8 and human EGFR, The rationale for 
the selection of IL-8 and EGFR is as follows. 

IL-8 is a member of the C-X-C chemokine family. IL-8 acts as the primary chemoattractant 
for neutrophils implicated in many diseases, including ARDS, rheumatoid arthritis, inflammatory 
bowel disease, glomerulonephritis, psoriasis, a;lcoholic hepatitis, reperfiision injury, to name a few. 
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Moreover, IL-8 is a potent angiogenic factor for endothelial cells. In Figures ll-l^^ we demonstrate 
that human anti-IL-8 ant" odies derived from XenoMouse II strains are effective in a inhibiting IL-8's 
actions in a number of pathways. For example. Figure 22 shows blockage of IL-S binding to human 
neutrophils by human anti-IL-8. Figure 23 shows inhibition of CD lib expression on human 
neutrophils by human anti-IL-8. Figufe 24 shows inhibition of IL-8 induced calcium influx by human 
anti-IL-8 antibodies. Figure 25 shows inhibition of IL-8 RB/293 chemotaxsis by human anti-IL-8 
antibodies. Figure 26 is a schematic diagram of a rabbit model of human IL-8 induced skin 
inflammation. Figure 27 shows the inhibition of human IL-8 induced skin inflammation in the rabbit 
model of Figure 26 with human anti-IL-8 antibodies. Figure 28 shows inhibition of angiogenesis of 
endothelial cells on a rat corneal pocket model by human anti-IL-8 antibodies. 

EGFR is viewed as an anti-cancer target. For example, EGFR is overexpressed, up to 100 
fold, on a variety of cancer cells. Ligand (EGF and TNF) mediated growth stimulation plays a critical 
role in the initiation and progression of certain tumors. In this regard, EGFR antibodies inhibit ligand 
binding and lead to the arrest of tumor cell growth, and, in conjunaion with chemotherapeutic agents, 
induces apoptosis. Indeed, it has been demonstrated that a combination of EGFR Mabs resulted in 
tumor eradication in murine xenogeneic tumor models. Imclone has conducted Phase I clinical 
utilizing a chimeric Mab (C225) that proved to be safe. In Figures 31-33, we demonstrate data 
related to our human anti-EGFR antibodies. Figure 30 shows heavy chain amino acid sequences of 
human anti-EGFR antibodies derived from XenoMouse II strains. Figure 3 1 shows blockage EGF 
binding to A43 1 cells by human anti-EGFR antibodies. Figure 32 shows inhibition of EGF binding 
to SW948 cells by human anti-EGFR antibodies. Figure 33 shows that human anti-EGFR antibodies 
derived from XenoMouse II strains inhibit growth of S W948 cells //; vitro. 

Example 9: High a ffinity, antigen-specinc h uman Mabs produced hv XenoMouse II 

We next asked whether the demonstrated utilization of the large human repertoire in 

XenoMouse II could be harnessed to generate human antibodies to multiple antigens, in particular, 

human antigens of significant clinical interest. 

Accordingly, individual XenoMouse II pups were challenged each with one of three different 

antigen targets, human IL-8, human EGFR and human TNF-a. Antigens were administered in two 
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dlOferent forms, dther as soluble protein, in the case of IL-8 and TNF-a or expressed on the surface 
of cells (A43 1 cells), in ♦^e case of EGFR. For all three antigens, ELISAs performed on sera from 
immunized mice indicated a strong antigen-specific human antibody (IgG, IgK) response with titers 



Hybridomas were derived from spleen or lymph node tissues by standard hybridoma 
technology and were screened for secretion of antigen-specific human Mabs by ELIS A. 

An IL-8 immunized XenoMouse II yielded a panel of 12 hybridomas, all secreting fully human 
(hIgGjK) Mabs specific to human IL-8. Antibodies from four of these hybridomas, D 1 1 , K2.2, K4.2, 
and K4.3, were purified from ascitic fluid and evaluated for their affinity for human LL-8 and their 
potency in blocking binding of IL-8 to its receptors on human neutrophils. 

Affinity measurements were performed by solid phase measurements of both whole antibody 
and Fab fragments using surface plasmon resonance in BIAcore and in solution by radioimmunoassay 
(Materials and Methods). As shown in Table V, affinity values measured for the four Mabs ranged 
from 1,1x10^ to 4.8x10*° M"^ While there was some variation in the techniques employed, affinity 
values for all four antibodies were consistently higher than 10^ M"'. 

ELISA analysis confirmed that these four antibodies were specific to human IL-8 and did not 
cross-react with the closely related chemokines MlP-lot, GROcc, p, and ENA-78, MCP-I, or 
RANTES (data not shown). Further, competition analysis on the BIAcore indicated that the 
antibodies recognize at least two different epitopes (data not shown). Ail antibodies inhibit IL-8 
binding to human neutrophils as effectively as the murine anti-human IL-8 neutralizing antibody, 
whereas a control human IgG2K antibody did not (Fig. 5A). 

Fusion experiments with EGFR-immunized Xenomouse II yielded a panel of 25 hybridomas, 
all secreting EGFR-specific human IgG2K Mabs. Of the thirteen human Mabs analyzed, four (E2. 1 , 
E2.4, E2.5, E2. 1 1) were selected for their ability to compete with EGFR-specific mouse antibody 
225, which has previously been shown to inhibit EGF-mediated cell proliferation and tumor formation 
in mice (Sato et al., 1983). These human antibodies, purified from ascitic fluid, were evaluated for 
their affinity for EGFR and neutralization of EGF binding to cells. The affinities of these antibodies 
for EGFR, as determined by BIAcore measurements, ranged from 2.9x10'^ to 2.9x10^° M ' (Table V). 
All four anti-EGFR antibodies completely blocked EGF binding to A43 1 cells (Fig. 5B), 



as high as 1:3x10*. 



Negligible mouse X response was detected. 
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demonstrating their ability to neutralize its binding to both high and low affinity receptors on these 
cells (Kawamoto et al, 1983). Complete inhibition of EGF binding to EGFR expressed on h'^man 
SW948 human lung cardnoma cdls by all four anti-EGFR human antibodies was also observed (data 
not shown). In both cases, the fiilly human antibodies were as effective in inhibition of EGF binding 
as the ant: EGFR mouse antibody 225 and more potent than the 528 antibody (Gill et al., 1983). In 
both cell assays, a control human IgGjK antibody did not affect EGF binding (Fig. 5B and data not 
shown). 

Fusion experiments with TNF-a immunized Xenomouse II yielded a panel of 12 human IgGjK 
antibodies. Four out of the 12 were selected for their ability to block the binding of TNF-a to its 
receptors on U937 cells (Fig. 5C). The affinities of these antibodies were determined to be in the 
range of 1 .2-3.9x10^ M"* (Table V). 

The described Xenomouse-derived hybridomas produced antibodies at concentrations in the 
range of 2-19 ng/ml in static culture conditions. Characterization of the purified antibodies on protein 
gels under non-reducing conditions revealed the expected apparent molecular weight of 150 kD for 
the IgGzK antibody. Under reducing conditions the expected apparent molecular weights of 50 kD 
for the heavy and 25 kD for the light chain were detected (data not shown). 

Table V, below, shows affinity constants of XenoMouse-derived antigen-specific fully human 
Mabs. The affinity constants of XenoMouse-derived human IgG2K Mabs specific to IL-8, EGFR, and 
TNF-a were determined by BIAcore or by radioimmunoassay as described in Materials and Methods. 
The values shown for IL-8 and EGFR are representative of independent experiments carried out with 
purified antibodies, while the values, shown for TNF-a are from experiments carried out with 
hybridoma supernatants. 
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TABLE V 



Human 
Mab 
IpG,K 


Antigen 


ka (M'»S->) 


kd(S') 


KA(M') 




Surface 
Density 
FRUl' 


Radio 1 
Immunoassay 1 








Solid] 


"base MeasuTf 


ements 




Solution 


Dl.l 


IL-S 


2.7 X 10* 


9.9 X 10* 


2.7 X 10» 


3,7 X lO-**" 


81 


2.0 X 10" 


Dl.l Fab 


IL-8 


2.1 X 10* 


2.1x10-'. 


1.1 X 10' 


8.8 X 10" 


81 


4.9 X 10" 


K2.2 


IL-8 


0.9 X 10* 


2.3 X 10-* 


4.0 X 10' 


2.5x 10»° 


81 


1.0 X 10" 


K4.2 


IL.8 


2,5 X 10* 


4.1 X 10-* 


6,3 X 10' 


1.6 X 10" 


81 


ND 


K4.3 


IL-8 


4.3 X 10* 


9.4 X 10** 


4.5 X 10' 


2.2 X 10 **' 


81 


2.1 X 10" 


K4.3 Fab 


IL-S 


6.0 X 10* 


2.1 X 10-' 


2.9 X 10' 


3.4 X 10-'° 


81 






. . ^ 












ELISA (M) 


El.l 


EGFR 


1.9x 10* 


6.5 X 10-" 


2.9 X 10' 


3.46 X 10-" 


303 


1.1 X 10" 


E2.5 


EGFR 


2.1 X 10* 


1.8 X 10-* 


1.2x 10'** 


8.44 X 10*" 


303 


3.6 X 10*" 


E2.n 


EGFR 


i.7x 10* 


4.7 X lO'-* 


3.7 X 10' 


2.68 X 10*" 


303 


1.1 X 10*" 


E2.4 


EGFR 


2.8 X lb* 


9.78 X 10 * 


2,9 x 10'° 


3.5 X 10-" 


818 


1.1 X 10-" 




;>|:¥;;%;::|:;:|:|:|:i:f:::|:^^^^^^^ 














T22.1 


TNF-a 


1.6x10* 


1,3 X 10 ' 


1,2 X 10' 


8.06 X 10-" 


107 




T22.4 


TNF-a 


2.4 X 10* 


4.6 X 10-* 


5.3 X 10' 


1.89 X 10"'** 


107 




T22.8 


TNF-a 


1.7 X 10* 


7.5 X 10** 


2.3 X 10' 


4.3 X 10 " 


107 




T22.9 


ThfF-a 


2.3 X 10* 


4.9 X 10** 


4.8 X 10' 


2.11 X 10" 


107 




T22.il 


TNF-a 1 


2.9 X 10* 


7.9 X 10-* 


N/A 


2.76 X 10" 


107 





Example fQ: Gene usaee and somatic hvpermuta tion in mnnoclonal antihodiKs 

The sequences of the heavy and kappa light chain transcripts from the described IL-8 and 
EGFR-human Mabs were determined Figure 6 and Figures [[ ]]. The four IL-8-specific antibodies 
consisted of at least three different V„ genes (y„^3^n/„,.^„ W ^.^ and V„5.,^, four different D„ 
segments (A1/A4. Kl. ir3rc. and 21-lOrc) and two J„ (Jhj and J,„) gene segments. Three different 
Vk genes (012, 018, and B3) combined with Jk3 and Jk4 genes. Such diverse utilization shows that 
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Xenomouse II is capable of producing a panel of anti-IL-8 neutralizing antibodies with diverse 
variable regions. 

In contrast to the IL-8 antibody transcripts, the sequences of antibodies selected for their 
ability to compete with Mab 225 showed relatively restricted V„ and Vk gene usage, with three 
antibodies. El. 1, E2.4 and E2.5 sharing the same gene (4-3 1) and E2. 1 1 containing V„4^„ which 
is lughly homologous to V„^3,. Different D (2, A1/A4, XPl) and J„ (Jh3, J„4, J„5) segments were 
detected. All four antibodies were shown to share the same Vk (018) gene. Three of them 
contained Jk4, and one, E2.S, contained Jk2. 

Most Vh and Vk hybridoma transcripts showed extensive nucleotide changes (7-17) from the 
corresponding germline segments, whereas no mutations were detected in the constant regions. Most 
of the mutations in V segments resulted in amino acid substitutions in the predicted antibody amino 
acid sequences (0-12 per V gene), many in CDRl and CDR2 regions (Figure J), Of note are the 
mutations which are shared by the heavy chain sequences of EGFR antibodies, such as the Gly-*Asp 
substitution in CDRl, shared by all antibodies, or Serr Asn substitution in CDR2 and VaHLeu in the 
framework region 3 shared by three antibodies. These results indicated that an extensive process of 
somatic hypermutation, leading to antibody maturation and selection, is occurring in Xenomouse II. 



This present application describes the first functional substitution of complex, megabase-sized 
mouse loci, with human DNA fragments equivalent in size and content reconstructed on YACs. With 
this approach, the mouse humoral immune system was "humanized" with megabase-sized human Ig 
loci to substantially reproduce the human antibody response in mice deficient in endogenous antibody 
production. 

Our success in faithful reconstruction of a large portion of the human heavy and kappa light 
chain loci, nearly in germline configuration, establishes YAC recombination in yeast as a powerful 
technology to reconstitute large, complex and unstable fragments, such as the Ig loci (Mendez et al., 
1995), and manipulate them for introduction into mammalian cells. Furthermore, the successful 
introduction of the two large heavy and kappa light chain segments into the mouse germline in intact 
form confirms the methodology of ES cell-yeast spheroplast fusion as a reliable and efficient approach 
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to delivering xenogeneic loci into the mouse gennline. 

Characterization of Xenomouse II strains has shown that the large Ig loci were capable of 
restoring the antibody system, comparable in its diversity and functionality to that of ^y^ldtype mice, 
and much superior to the humoral response produced in mice bearing human Ig minigene constructs 
(Lonberg et al„ 1994) or small human Ig YACs (Green et al, 1994). This difference was manifested 
in the levels of mature B-cells, human Ig production, class switching efficiency, diversity, 
preponderance of human IgK over murine IgA. production, and magnitude of the human antibody 
response, and success in the generation of high affinity, antigen-specific monoclonal antibodies to 
multiple antigens. 

The levels of mature B-cells and human antibodies in Xenomouse II are the highest yet 
reported for Ig-transgenic mice, representing a several-fold increase over the levels shown for 
previous mice and approaching those of wildtype mice. In particular, the levels of the human IgG 
were more than 100 fold higher than those reported for mice bearing minilocus Ig transgenes with 
human y 1 gene (Lonberg et al., 1994). The more eflBcient class switching in Xenomouse II was likely 
the result of the inclusion of the entire switch regions, with all of their regulatory elements, as well 
as the additional control elements on yH2, which may be important to support and maintain proper 
class switching. The elevated levels of mature B-cells in Xenomouse II strains are likely to result 
from the higher rearrangement frequency and thus improved B-cell development in the bone marrow 
due to the increased V gene repertoire. B-cell reconstitution is expeaed to be even more pronounced 
in XenoMouse II strains that are homozygous for the human heavy chain locus. 

The ratio of human k to mouse A light chain expression by circulating B-cells provides a 
useful internal measure of the utilization of the transgenic kappa chain locus. Whereas in mice 
containing one allele of smaller Ig YACs, an approximately equal distribution of human k and mouse 
A was observed, a significant preponderance of human k was detected in Xenomouse II strains. 
Moreover, in animals homozygous for yK2 possessed a k:A ratio that is identical to wild type mice. 
These observations together with the broad Vk gene usage strongly suggest that the human proximal 
Vk genes in the Xenomouse II are sufficient to support a diverse light chain response and are 
consistent with the bias toward proximal Vk gene usage in humans (Cox et al., 1994). 

Xenomouse II strains exhibited highly increased antibody diversity with V, D, and J genes 
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across the entire span of the loci accessed by the recombination mechanism and incorporated into 
mature antibodies. Once triggered by antigen binding, extensive somatic hj^ermutation occurs, 
leading to affinity maturation of the antibodies. 

The utilization pattern of V, D, J genes in Xenomouse II also indicated they are available and 
utilized in a manner reminiscent of their utilization in humans, yielding an adult-like human antibody 
repertoire, which is different from the fetal-like, position-biased usiage observed in Ig 
minigene-bearing mice (Taylor et al., 1992; Taylor et al.. 1994; Tuaillon et al., 1993). The broad 
utilization of many of the functional V„ and Vk genes together with the multiplicity of antigens 
recognized by the mice underscores the importance of the large V gene repertoire to successfully 
reconstituting a functional antibody response. 

The ultimate test for the extent of reconstitution of the human immune response in mice is the 
spectrum of antigens to which the mice will elicit an antibody response and the ease with which 
antigen-specific high affinity Mabs can be generated to different antigens. Unlike mice engineered 
with smaller human Ig YACs or minigenes, which yielded to date only a limited number of 
antigen-specific human Mabs (Lonberg et al.» 1994; Green et al., 1994; Fishwild et al., 1996), 
Xenomouse II generated Mabs to all human antigens tested to date. Xenomouse II strains mounted 
a strong human antibody response to different human antigens, presented either as soluble proteins 
or expressed on the surfaces of cells. Immunization with each of the three human antigens tested 
yielded a panel of 10-25 antigen-specific human IgG^K Mabs. For each antigen, a set of antibodies 
with affinities in the range of 10'- 10*° M** was obtained. Several measures were taken to confirm that 
the affinity values represent univalent binding kinetics rather than avidity: BIAcore assays with intact 
antibodies were carried out with sensor chips coated at low antigen density to minimize the 
probability of bivalent binding; for two antibodies, the assay was repeated with monovalent Fab 
fitigments; some of the antibodies were also tested by solution radioimmunoassay. From the results 
of these measurements, we conclude that antibodies with aflfmities in the range of 10*° M"* are readily 
attainable with the XenoMouse. The afilnity values obtained for XenoMouse-derived antibodies are 
the highest to be reported for human antibodies against human antigens produced from either 
engineered mice (Lonberg et aL, Fishwild et al., 1996) or from combinatorial libraries (Vaughan et 
al., 1996). These high affinities combined with the extensive amino acid substitution as a result of 
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somatic mutation in the V genes confirms that the mechanism of affinity maturation is intact in 
Xenomouse II and comparable to that in wildtype mice. 

These results show that the large antibody repertoire on the human Ig YACs is being properly 
exploited by the mouse machinery for antibody diversification and selection, and, due to the lack of 
immunological tolerance to human proteins, can yield high affinity antibodies against any antigen of 
interest, including human antigens. The facility with whicH antibodies to human antigens can be 
generated by the human inmiunoglobulin genes in these mice provides fijrther confirmation that self 
tolerance at the B*cell level is acquired and not inherited. 

The ability to generate high affinity fully human antibodies to human antigens has obvious 
practical implications. Fully human antibodies are expected to minimize the immunogenic and allergic 
re^onses intrinsic to mouse or moiise-derivatized Mabs and thus to increase the efficacy and safety 
of the administered antibodies. Xenomouse II offers the opportunity of providing a substantial 
advantage in the treatment of chronic and recurring human diseases, such as inflammation, 
autoimmunity, and cancer, which require repeated antibody administrations. The rapidity and 
reproducibility with which XenoMouse II yields a panel of folly human high affinity antibodies 
indicates the potential advance it offers over other technologies for human antibody production. For 
example, in contrast to phage display, which requires intensive efforts to enhance the affinity of many 
of its derived antibodies and yields single chain Fvs or Fabs, Xenomouse 11 antibodies are high affinity 
folly intact immunoglobulins which can be produced from hyllridomas without forther engineering. 

The strategy described here for creation of an authentic human humoral immune system in 
mice can be applied towards humanization of other multi-gene loci, such as the T cell receptor or the 
major histocompatibility complex, that govern other compartments of the mouse immune system 
(Jakobovits, 1994). Such mice would be valuable for elucidating the structure-fonction relationships 
of the human loci and their involvement in the evolution of the immune system. 

Incorporation By Reference 
All references cited herein, including patents, patent applications, papers, text books, and the 
like, and the references cited therein, to the extent that they are not already, are hereby incorporated 
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SEQUENCE LISTING 

I ) GENERAL INFORMATION 

(i) APPLICANT: Abgcnix, Inc. 

(ii) TITLE OF THE INVENTION: TRANSGENIC MAMNIALS HAVING HUMAN IG LOCI 
INCLUDING PLURAL VH AND VK ... 

(iii) NUMBER OF SEQUENCES: 80 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Neavc 

(B) STREET: 1251 Avenueof the Americas 

(C) CITY: New York 

(D) STATE: NY 

(E) COUNTRY: USA 

(F) ZIP: 10020 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1 .5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 03-DEC-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/759,620 

(B) FILING DATE: 63-DEC- 1 996 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: James, Haley F 

(B) REGISTRATION NUMBER: 27.794 

(C) REFERENCE/DOCKET NUMBER: Cell 4. 1 8 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212-596-9000 

(B) TELEFAX: 212-596-9090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 
CAGGTGCAGC TGGAGCAGTC GG 22 
(2) INFORMATION FOR SEQ ID NO:2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
GCTGAGGGAG TAGAGTCCTG AGGA 24 
(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO. 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TTACTGTGCGAGACA 15 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GGGAGCTACOGG 12 

(2) INFORMATION FOR SEQ ID NO:5: 

(1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
GACT ACTGGG GC 12 
(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

TTACTGTGCG AGAGA 1 5 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv)ANTISENSE:NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



(2) INFORMATION FOR SEQ ID NO:8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

TTACTGTACC ACAGA 15 

(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 1 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
ACTAACTACC C 1 1 

(2) INFORMATION FOR SEQ ID NO: 1 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 



CTTTGACTAC TGGGGC 



16 
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(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



CTACTACTAC TACGGT 



16 



(2) INFORMATION FOR SEQ ID NO: 1 1 : 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 1 : 
TTACTGTGCG AGAGA 1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 2; 

TAGGAGTGTT 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRAhJDEDNESS: single 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 

(xi) SEQUENCI DESCRIPTION: SEQ ID NO:l 3: 



(2) INFORMATION FOR SEQ ID NO: 14: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ED NO: 14: 
ACTACTACTA CTACGGT 17 
(2) INFORMATION FOR SEQ ID NO: 15; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TTACTGTGCG AGAG 14 

(2) INFORMATION FOR SEQ ID NO: 1 6: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 



GTAGTACC AG CTGCTAT 



17 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



GCAGCAGCTG 



10 



(2) INFORMATION FOR SEQ ID NO: 1 7: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE:NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOrl 7: 
CTTTGACTAC TGGGGC 16 
(2) INFORMATION FOR SEQ ID NO: 1 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE:NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 18: 
TTACTGTGCG AGAGA 1 5 

(2) INFORMATION FOR SEQ ID NO:l 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
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GATATTTTGA CTGGT 



15 



(2) INFORMATION FOR SEQ ID NO:20: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

CTACTACTAC TACGGT 16 

(2) INFORMATION FOR SEQ ID NO:21 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: . 
TTACTGTGCG AGAGA 15 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO ^ 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
TTTG ACTACT GGGGC 1 5 
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(2) INFORMATION FOR SEQ ID NO:23: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

TTACTGTGCG AGAGA 1 5 

(2) INFORMATION FOR SEQ ID NO:24: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
TACTACTACT ACTACGGT 1 8 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
ATTACTGTGCGA 12 

(2) INFORMATION FOR SEQ ID NO:26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

TATAGCAGTG GCTGGT 16 



(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
CTTTGACTAC TGGGGC 16 



(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
TATTACTGTG CGAG 1 4 

(2) INFORMATION FOR SEQ ID NO:29: 
(i) SEQUENCE CHARACTERISTICS: 



-63 . 



WO 98/24893 PCT/US97/23091 



(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GACTACTGGGGC 12 
(2) INFORMATION FOR SEQ ID NO:30: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ! 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
TATTACTGTGCG 12 
(2) INFORMATION FOR SEQ ID NO:3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3 1 : 
GGATATAGTA GTGG 1 4 

(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

CTTTGACTAC TGGGGC 16 

(2) INFORMATION FOR SEQ ID NO:33: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;33: 
TTACTGTGCG AGAC A ] 5 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ED NO:34: 
ATGCTTTGAT ATCTGGGG 1 8 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 8 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE; 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
TTAAACGAAC AGTACCCC 1 8 

(2) INFORMATION FOR SEQ ID NO:36: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
GATCACCTTC GGCCAA 16 
(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
ACAGGCTAAC AGTTTCCCTC 20 

(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECUI^ TYPE: cDNA 

(iii) HYPOTHEflCAL: NO 

(iv) ANTISENSE:NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
GGACGTTCGG CCAA 14 
(2) INFORMATION FOR SEQ ID NO:39: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 7 base pairs 

(B) TYPE; nucleic acid . 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
AAGTATAACA GTGCCCC 17 
(2) INFORMATION FOR SEQ ID NO:40; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
ATTCACTTTC GGCCCT 16 

(2) INFORMATION FOR SEQ ED NO:4 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) /vNTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ED N0:4 1 : 
ACAGTATGAT AATCTCCC 1 8 

(2) INFORMATION FOR SEQ ED NO:42: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
GCTCACTTTC GGCGGA 16 
(2) INFORMATION FOR SEQ ID NO:43 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) AlsrriSENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

AAAGTATAAT AGTTACCC 1 8 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GATC ACCTTC GGCCAA 1 6 

(2) INFORMATION FOR SEQ ID NO:45: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 
C AGCATAATA GTTACCC 1 7 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
ATTCACTTTC GGCCCT 16 
(2) INFORMATION FOR SEQ ID NO:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 



AATATTATAG TACTCC 



16 



(2) INFORMATION FOR SEQ ID NO:48: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
GCTCACTTTC GGCGGA 1 6 

(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
C AGTATGGTA GCTCACCTC 1 9 

(2) INFORMATION FOR SEQ ID NO:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(V) FRAGMENT TYPE: 
(vi) ORIGINAL SOURCE; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

CACTTTTGGCCAG 

(2) INFORMATION FOR SEQ ID NO:5 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5 1 : 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 
15 10 15 

Ser Thr Ser Thr 



(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRAlvTOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Leu Ser Leu Thr Cys Ala Val Tyr Gly Gly Ser Phe Ser Gly Tyr Tyr 
1 5 10 15 

Trp Ser Trp lie Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp He Gly 

20 25 30 

Glu lie Asn His Ser Gly Ser Thr Asn Tyr Asn Pro Ser Leu Lys Ser 

35 40 45 

Arg Val Thr He Ser Val Asp Thr Ser Lys Asn Gin Phe Ser Leu Lys 

50 55 60 V 

Leu Ser Ser Val Thr Ala Ala Asp Thr Aia Val Tyr Tyr Cys Ala Arg 



20 



65 



70 



75 



80 
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(2) INFORMATION FOR SEQ ID NO:53: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 1 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
- (iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53; 

Leu Scr Leu Thr Cys Ala Val Tyr Gly Gly Scr Phe Scr Gly Tyr Tyr 
1 5 10 15 

Trp Ser Trp He Arg Gin Pro Pro Gly Lys Gly Leu Glu Tip He Gly 

20 25 30 

Glu lie Asn Gin Ser Gly Ser Thr Asn Tyr Asn Pro Ser Leu Lys Ser 

35 40 45 

Arg Val He He Ser He Asp Thr Ser Lvs Thr Ghi Phe Ser Leu Lys 

50 55 60 

Leu Scr Ser Val Thr AJa Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg 
65 70 75 80 

Glu Thr Pro His AJa Phe Asp He Trp Gly Gin Gly Thr Met Val Thr 

85 90 95 

Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro 

100 105 ' 110 

Cys Ser Arg Ser Thr Ser Thr 
115 

(2) HMFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 

Leu Ser Leu Thr Cys Ala Val Tyr Gly Gly Scr Phe Ser Gly Tyr Tyr 
1 5 10 15 

Trp Thr Trp He Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp He Gly 

20 25 30 

Glu He He His His Gly Asn Thr Asn Tyr Asn Pro Scr Leu Lys Scr 
35 40 45 
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Arg Val Scr He Scr Val Asp Thr Ser Lys Asn Gin Phc Ser Leu Thr 

50 55 60 

Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg 

65 70 ■ 75 80 

Gly Gly Ala Val Ala Ala Phe Asp Tyr Trp Gly Gin Gly Thr Leu Val 

85 90 95 

Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala 

100 105 110 

Pro Cys Ser Arg Ser Thr Ser Thr 
115 120 

(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

Ser His His Leu Lys He Ser Cys Lys Gly Ser Gly Tyr Ser Phe Thr 
1 5 10 15 

Ser Tyr Trp He Glv Trp Val Arg Gin Mel Pro Gly Lys Gly Leu Glu 

20 " 25 30 

Trp Met Gly He He Tyr Pro Gly Asp Ser Asp Thr Arg Tyr Ser Pro 

35 40' 45 

Ser Phe Gin Glv Gin Val Thr He Ser Ala Asp Lys Scr He Ser Thr 

50 55 60 

Ala Tyr Leu Gin Trp Ser Ser Leu Lys Ala Ser Asp Thr Ala Mel Tyr 
65 70 75 80 

Tyr Cys Ala Arg 

(2) INFORMATION FOR SEQ H^ NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 



-73 - 



wo 98/24893 ^ PCTAJS97/23091 



Ser Leu Lys lie Scr Cys Lys Glv Ser Gly Tyr Ser Phe Thr Ser Tyr 
1 5 10 15 

Trp He Gly Trp Va. sg Gin Met Pro Gly Lys Gly Leu Giu Tip Met 

20 25 30 

Gly He He Tyr Pro Gly Asp Ser Asp Thr Arg Tyr Ser Pro Ser Phe 

35 40 45 

Gin Gly Gin Val Thr He Ser AJa Asp Lys Ser He Ser Thr Ala Tyr 

50 55 60 

Leu Gin Trp Ser Ser Leu Lys Ala Ser Asp Thr Ala Mel Tyr Tyr Cys 
65 70 75 80 

Ala Arg Gin Asp Gly Asp Ser Phe Asp Tyr Trp Gly Gin Gly Thr Leu 

85 90 95 

Val Thr Val Ser Ser Ala.Ser Thr Lys Gly Pro Ser Val Phe Pro Leu 

100 105 110 

Ala Pro Cys Ser Arg Ser Thr Ser Thr 
115 120 

(2) INFORMATION FOR SEQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 83 anuno acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE:.NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Arg Scr Leu Arg Leu Scr Cvs Ala Ala Ser Glv Phe Thr Phe Ser Ser 
15 10 15 ■ 

Tyr Gly Met His Trp Xaa Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp 

20 25 30 

Val AJa Val He Ser Tyr Asp Glv Ser Asn Lys Tyr Tyr Ala Asp Scr 

35 40 45 

Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser Lvs Asn Thr Leu 

50 55 60 

Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr 
65 70 75 80 

Cys Ala Arg 



(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMEN'^ TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

Arg Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phc Thr Phc Scr Scr 
15 10 15 

Tyr Gly Mel His Tip Xaa Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp 

20 25 30 

Val Ala Glu He Ser Tyr Asp Gly Ser Asn Lys Tyr Tyr Val Asp Ser 

35 40 45 

Val Lys Gly Arg Leu Thr He Ser Arg Asp Asn Ser Lys Asn Thr Leu 

50 55 60 

Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr 
65 70 75 80 

Cys Ala Arg Asp Arg Leu Gly He Phe Asp Tyr Trp Gly Gin Gly Thr 

85 90 95 

Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phc Pro 

100 105 110 

Leu Ala Pro Cys Ser Arg Ser Thr Ser Thr 
115 120 

(2) INFORMATION FOR SEQ ID NO:59. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu 
1 5 10 15 

Gin 



(2) nsJFORMATION FOR SEQ ID NO:60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
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(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

Thr He Thr Cys Gin AJa Ser Gin Asp He Scr Asn Tyr Leu Asn Tip 
' 5 10 15 

Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Asp Ala 

20 25 30 

Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phc Ser Gly Scr Gly Scr 

35 40 45 

Gly Tlir Asp Phe Thr Phe Thr He Ser Ser Leu Gin Pro Glu Asp He 

50 55 60 

Ala Thr Tyr Tyr Cys Gin Gin Asp Asn Leu Pro 
65 70 75 

(2) n^ORMATION FOR SEQ ID N0:6 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO:61 : 

Thr He Thr Cys Gin Ala Ser Gin Asp He Ser Lys Phe Leu Ser Tip 
1 5 10 15 

Phe Gin Gin Lys Pro Glv Lys Ala Pro Lvs Leu Leu He Tyr Gly Thr 

20 25' 30 

Ser Tyr Leu Glu Thr Gly Val Pro Ser Ser Phe Ser Gly Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Leu Thr He Ser Ser Leu Gbi Pro Glu Asp Val 

50 55 60 

Ala Thr Tyr Phe Cys Gin Gin Asp Asp Leu Pro Tyr Thr Phe Gly Pro 
65 70 75 80 

Gly Thr Lys Val Asp He Lys Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

He Phe Pro Pro Ser Asp Glu Gin 



(2) HsTFORMATION FOR SEQ HD NO:62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



100 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETIC AL: NO 

(iv) ANTISENSE:NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Thr He Thr Cys Gin Ala Ser Gin Asp He Ser Asn Tyr Leu Asn Trp 
1 5 10 15 

Tyr Gin Gin Lys Ala Gly Lys Ala Pro Lys Val Leu He Tyr Ala Ala 

20 25 30 

Ser Asn Leu Glu Ala Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Ser Leu Gin Pro Glu Asp He 

50 55 60 

Ala Thr Tyr Tyr Cys His Gin Asp Asn Leu Pro Leu Thr Phe Gly Gly 
65 70 75 80 

Gly Thr Lys Val Glu lie Lys Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

He Phe Pro Pro Ser Asp Glu Gin 



(2) INFORMATION FOR SEQ ID NO:63: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRA>TOEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE; NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

Thr Cys Arg Ala Ser Gin Ser He Ser Ser Tyr Leu Asn Trp Tyr Gin 
1 5 10 15 

Gin Lys Pro Glv Lys Ala Pro Lys Leu Leu He Tyr Ala Ala Ser Ser 

20 25 30 

Leu Gin Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr 

35 40 45 

Asp Phe Thr Leu Thr He Ser Ser Leu Gb Pro Glu Asp Phe Ala Thr 

50 55 60 

Tyr Tyr Cys Gin Ghi Ser Tyr Ser Thr Pro 
65 70 

(2) nslFORMATION FOR SEQ ID NO:64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid. 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
. (iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

Thr Cys Arg Ala Ser Gin Ser He Ser Asn Tyr Leu Asn Trp Tyr Gin 
1 5 10 15 

Gin Lys Pro Gly Lys Ala Pro Lys Phc Leu He Tyr Gly Ala Ser Ser 

20 25 30 

Leu Glu Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr • 

35 40 45 

Asp Phe Thr Leu Thr He Ser Ser Leu Ghi Pro Glu Asp Phe Ala Thr 

50 55 60 

Tyr Tyr Cys Gin Gin Ser Tyr Ser Asn Pro Leu Thr Phe Gly Gly Gly 
65 70 75 80 

Thr Lys Val Glu He Lys 
85 

(2) INFORMATION FOR SEQ ID NO:65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

Thr lie Asn Cys Lys Ser Ser Gin Ser Val Leu Tyr Ser Ser Asn Asn 
1 5 . 10 15 

Lys Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Ghi Pro Pro Lys 

20 25 30 

Leu Leu He T>t Trp Ala Ser Thr Arg Glu Ser Gly Val Pro Asp Arg 

35 40 45 

Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser 

50 55 60 

Leu Ghi Ala Glu Asp Val Ala Val Tyr Tyr Cys Gin Ghi Tyr Tyr Ser 
65 .70 75 80 

Thr Pro 



(2) INFORMATION FOR SEQ ID NO:66: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: iniemaJ 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

Thr He Asn Cys Lys Ser Ser Gin Ser Val Leu Tyr He Ser Asn Asn 
1 5 10 15 

Lys Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ser Pro Lys 

20 25 30 

Leu Leu He Tyr Trp Ala Ser Thr Arg Lys Ser Gly Val Pro Asp Arg 

35 40 45 

Phe Ser Glv Ser Gly Ser Glv Thr Asp Phe Thr Leu Thr He Ser Ser 

50 ' 55 ' 60 

Leu Gin Ala GIu Asp Val Ala Val Tyr Tyr Cys Gin Gin Tyr Tyr Asp 
65 70 75 80 

Thr Pro Phe Thr Phe Gly Pro Glv Thr Lys Val Asp He Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: smgle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser Arg 
1 5 10 15 

Ser Thr Ser Thr 



(2) INFORMATION FOR SEQ ID NO:68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE:NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ED NO:68: 

Val Ser Gly Gly Ser He Ser Ser Gly Gly Tyr Tyr Tip Ser Trp He 
I 5 10 15 

Arg Gin His Pro Gly Lys Gly Leu Glu Tip He Gly Tyr lie Tvr Tyr 

20 25 30 

Ser Gly Ser Thr Tyr Tyr Asn Pro Ser Leu Lys Ser Arg Val Thr He 

35 40 45 

Ser Val Asp Thr Ser Lys Asn Gin Phe Ser Leu Lys Leu Ser Ser Val 

50 55 60 

Thr Ala AJa Asp Thr Ala Val Tyr Tyr Cys AJa Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO:69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 1 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECUXE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

Val Ser Glv Glv Ser He Asn Ser Glv Asp Tyr Tvr Trp Ser Trp He 
1 5 * 10 15 

Arg Gin His Pro Gly Lys Gly Leu Asp Cys He Gly Tyr He tyr Tyr 

20 25 30 

Ser Gly Ser Thr Tyr Tyr Asn Pro Ser Leu Lys Ser Arg Val Thr He 

35 40 45 

Ser Val Asp Thr Ser Lys Asn Gin Phe Phe Leu Lys Leu Thr Ser Val 

50 55 60 

Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg Ser Thr Val Val 
65 70 75 80 

Asn Pro Gly Trp Phe Asp Pro Trp Gly Gin Gly Tvr Leu Val Thr Val 

85 90 95 

Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys 

100 105 110 

Ser Arg Ser Thr Ser Thr 
115 

(2) INFORMATION FOR SEQ ID NO:70: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1 1 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE:NO 

(v) FRAGN4ENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ED NO:70: 

Val Ser Gly Gly Ser He Asn Ser Gly Asp Tyr Tyr Trp Ser Trp He 
1 5 10 15 

Arg Gin His Pro Gly Lys Gly Leu Glu Trp He Gly Ser He Tyr Tyr 

20 25 30 

Ser Gly Asn Thr Phe Tyr Asn Pro Ser Leu Lys Ser Arg Val Thr He 

35 40 45 

Ser Leu Asp Thr Ser Lys Asn Gin Phe Ser Leu Lys Leu Ser Ser Val 

50 55 60 

Thr Ala Ala Asp Thr Ala Val Cys Tyr Cys Ala Arg Asn He Val Thr 
65 70 75 80 

Thr Gly Ala Phe Asp He Trp Gly Ghi Gly Thr Mel Va! Thr Val Ser 

85 90 95 

Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser 

100 105 110 

Arg Ser Thr Ser Thr 
115 

(2) INFORMATION FOR SEQ ID NO:7 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE:NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:71 : 

Val Ser Gly Gly Ser He Ser Ser Gly Asp Tyr Tyr Trp Thr Trp He 
1 5 10 15 

Arg Gbi His Pro Gly Lys Gly Leu Glu Trp lie Gly Tyr He Tyr Tyr 

20 25 30 

Ser Gly Asn Thr Tyr Tyr Asn Pro Ser Leu Lys Ser Arg Val Ser Met 

35 40 45 

Ser He Asp Thr Ser Glu Asn Gin Phe Ser Leu Lys Leu Ser Ser Val 

50 55 60 

Thr Ala Ala Asp Thr Ala Val Tyr Tyr Cys Ala Arg Lys Pro Val Thr 
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65 70 75 80 

Gly Gly Glu Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 

85 90 95 

Ala Ser Thr Lys Gly fro Ser VaJ Phc Pro Leu Ala Pro Cys Ser Arg 

100 105 110 

Ser Thr Ser Thr 
115 

(2) INFORMATION FOR SEQ ID NO;72: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76.amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: inlcraal 

(vi) ORIGINAL SOURCE: 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

Val Ser Gly Gly Ser Val Ser Ser Gly Ser Tyr Tvr Trp Ser Trp lie 
1 5 10 15. 

Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp He Glv Tyr He Tyr Tyr 

20 25 30 

Scr Gly Ser Thr Asn Tyr Asn Pro Ser Leu Lvs Ser Arg Val Thr He 

35 40 45 ' 

Ser Val Asp Thr Ser Lys Asn Gin Phe Scr Leu Lys Leu Ser Ser Val 

50 55 60 

Thr Ala Ala Asp Thr Ala Val Tvr T\t Cys Ala Arg 
65 70 75 ' ' 

(2) INFORMATION FOR SEQ ID NO:73 : . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

Val Ser Gly Gly Ser Val Ser Ser Gly Ser Tyr Tyr Trp Ser Trp He 
I 5 10 15 

Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp He Gly Tyr He Tyr Tyr 
20 25 30 
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Ser Gly Ser Thr Asa Tyr Asn Pro Ser Leu Lys Scr Arg Val Thr He 

35 40 45 

Ser Val Asp Thr Se- ^ys Asn Gin Phe Ser Leu Lys Leu Ser Ser Val 

50 55 60 

Thr Ala Ala Asp thr Ala Val Tyr Tyr Cvs Ala Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO:74: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 1 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 

Val Ser Gly Gly Ser Val Ser Ser Gly Asp Tyr Tyr Trp Ser Trp He 
1 5 10 15 

Arg Gin Pro Pro Glv Lys Gly Leu Glu Trp He Gly His Leu Tyr Tyr 

20 25 30 

Ser Giy Asn Thr Asn Tyr Asn Pro Ser Leu Lys Ser Arg Val Thr He 

35 40 45 

Ser Leu Asp Thr Ser Lys Asn Gin Phe Ser Leu Lys Leu Ser Ser Val 

50 55 60 

Thr Ala Ala Asp Thr Ala Val Tyr Tvr Cvs Ala Arg Asp Phe Leu Thr 
65 70 75 ' 80 . 

Gly Ser Phe Phe Asp Tvr Trp Glv Gin Glv Thr Leu Val Thr Val Ser 

85 90 95 

Ser Ala Ser Thr Lvs Gly Pro Ser Val Phe Pro Leu Ala Pro Cys Ser 

100 105 1 10 

Arg Ser Thr Ser Thr 
115 

(2) INFORMATION FOR SEQ ID NO:75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
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Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu 



(2) INFORMATION FOR SEQ ID NO:76: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL; NO 

(iv) ANTISENSE:NO 

(V) FRAGMENT TYPE: inlemal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

Thr He Thr Cys Gin Ala Ser Gin Asp He Ser Asn Tyr Leu Asn Tfp 
15 10 15 

Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Asp Aia 

20 25 30 

Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phe Ser Glv Ser Gly Ser 

.35 40 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Ser Leu Gin Pro Glu Asp He 

50 55 60 

AJa Thr Tyr Tyr Cys Gin Gin Asp Asn Leu Pro 
65 70 75 

(2) INFORMATION FOR SEQ ID NO:77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO « 

(V) FRAGMENT TYPE: internal 
(vi) ORIGH^AL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

Thr He Thr Cys Gin Ala Ser Gin Asp He Asn Asn Tyr Leu Asn Trp 
1 5 10 15 

Phe Gin Gin Lys Pro Gly Lys Ala Pro Lys Val Leu He His Asp Ala 

20 25 30 

Ser Asn Leu Glu Thr Gly Gly Pro Ser Arg Phe Ser Gly Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Gly Leu Gin Pro Glu Asp He 



5 



10 



15 



Ghi 
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50 55 60 

Ala Thr Tyr Tyr Cys Gin Gin Glu Ser Leu Pro Leu Thr Phe Gly Gly 
65 70 75 80 

Gly Thr Lys Val Glu lie Lys Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

lie Phe Pro Pro Ser Asp Glu Ghi 
100 

(2) INFORMATION FOR SEQ ID NO:78: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Thr He Thr Cys Gin Ala Ser Gin Asp He Thr He Tvr Leu Asn Trp 
15 10 15 ' 

Tyr Gin Ghi Lys Pro Gly Lys Ala Pro Lvs Leu Leu He Asn Asp Ala 

20 25 30 ' 

Ser Ser Leu Glu Thr Gly Val Pro Leu Arg Phe Ser Glv Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Ser Leu Gin Pro Glu Asp He 

50 55 60 

Ala Thr Tyr Tyr Cys Gin Gin Asp His Leu Pro Leu Thr Phe Gly Gly 
65 70 75 80 

Gly Thr Lys Val Ala lie Lys Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

He Phe Pro Pro Ser Asp Glu Gin 
100 

(2) INFORMATION FOR SEQ ID NO:79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 
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Thr He Thr Cys Gin Ala Ser Gin Asp He Ser Asn T>t Leu Asn Tip 
1 5 10 15 

Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Asp Ala 

20 25 30 

Ser Asn Leu Glu Thr Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser 

35 40 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Ser Leu Gin Pro Glu Asp Val 

50 55 60 

Gly Thr Tyr Tyr Val Gin Gin Glu Ser Leu Pro Cvs Gly Phe Gly Gin 
65 70 75 80 

Gly Thr Lys Leu Glu lie Lys Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

He Phe Pro Pro Ser Asp Glu Ghl 
100 

(2) INFORMATION FOR SEQ ID NO:80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(V) FRAGMENT TYPE: internal 
(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 

Thr He Thr Cys Ghi Ala Ser Gin Asp He Ser Asn Tyr Leu Asn Tip 
1 5 10 15 

Tyr Gin Gin Lvs Pro Gly Lys Ala Pro Lys Leu Leu He Asn Asp Ala 

20 ' 25 30 ' 

Ser Asp Leu Glu Thr Gly Val Pro Ser Arg He Ser Glv Ser Gly Ser 

35 40 ' 45 

Gly Thr Asp Phe Thr Phe Thr He Ser Asn Leu Gin Pro Glu Asp He 

50 55 60 

Ala Thr Tyr Tyr Cys Gin Gin Asp Ser Leu Pro Leu Thr Phe Gly Gly 
65 70 75 80 

Gly Thr Lys Val Glu He Arg Arg Thr Val Ala Ala Pro Ser Val Phe 

85 90 95 

He Phe Pro Pro Ser Asp Glu Gin 
100 
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ChAlMS 

WHAT IS T laIMED is : 

1, A transgenic non-human mammal having a genome, the genome comprising 
5 modifications, the modifications comprising: 

an inactivated endogenous immunoglobulin (Ig) locus, such that the mammal would 
not display normal B-cell development; 

an inserted human heavy chain Ig locus in substantially germline configuration, the 
human heavy chain Ig locus comprising a human mu constant region and regulatory and 
) switch sequences thereto, a plurality of human J„ genes, a pluraUty of human Dh genes, and 

a plurality of human V„ genes; and 

an inserted human kappa light chain Ig locus in substantially germline configuration, 
the human kappa light chain Ig locus comprising a human kappa constant region, a plurality 
of Jk genes, and a plurality of Vk genes, 

wherein the number of and Vk genes inserted are selected to substantially restore normal 
B-cell development in the mammal. 

2. The mammal of Claim 1, wherein the heavy chain Ig locus comprises a second 
constant region selected from the group consisting of human gamma- 1, human gamma*2, human 
gamma-3, human gamma-4, alpha, epsilon, and delta. 

3. The mammal of Claim 1, wherein the number of Vh genes is greater than about 20, 

4. The mammal of Claim 1 , wherein the number of Vk genes is greater than about 1 5 . 

5. The mammal of Claim 1 , wherein the number of Dh genes is greater than about 25, 
the number of Jh genes is greater than about 4, the number of Vh genes is greater than about 20, the 
number of Jk genes is greater than about 4, and the number of Vk genes is greater than about 15. 

6 . The mammal of Claim 1, wherein the number of Dh genes, the number of !„ genes, 
the number of V„ genes, the number of Jk genes, and the number of Vk genes are selected such that 
the Ig loci are capable of encoding greater than about 1x10* different fimctional antibody sequence 
combinations. 

7. The mammal of Claim 1, wherein in a population of mammals B-cell function is 
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reconstituted on average to greater than about 50% as compared to wild type. 

8. In a transgenic non-human mammal having a genome that comprises modifications, 
the modifications rendering the mammal capable of producing human immunoglobulin molecules but 
substantially incapable of producing functional endogenous antibody molecules, the improvement 
comprising: 

insertion into the genome of the mammal of sufficient human V„, D„, J„, Vk, and Jk 
genes such that the mammal is capable encoding greater than about 1 x 10^ different 
functional human immunoglobulin sequence combinations, without accounting for junctional 
diversity or somatic mutation events. 

9. In a transgenic non-human mammal having a genome that comprises modifications, 
the modifications rendering the mammal capable of producing human immunoglobulin molecules but 
substantially incapable of producing functional endogenous antibody molecules, which modifications, 
with respea to the mammal's incapacity to produce fiinaional endogenous antibody molecules would 
not allow the mammal to display normal B-cell development, the improvement comprising: 

insertion into the genome of the mammal of sufficient human V„, D„, J„, Vk, and Jk 
genes such that the mammal is capable of encoding greater than about 1x10^ different 
functional human immunoglobulin sequence combinations and sufficient and Vk genes to 
substantially restore normal B-cell development in the mammal. 

10. In the mammal of Claim 9, wherein in a population of mammals B-cell function is 
reconstituted on average to greater than about 50% as compared to wild type. 

11. A transgenic non-human mammal having a genome, the genome comprising 
modifications, the modifications comprising: 

an inactivated endogenous heavy chain immunoglobulin (Ig) locus; 
an inactivated endogenous kappa light chain Ig locus; 

an inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising 
a nucleotide sequence substantially corresponding to the nucleotide sequence of yH2; and 

an inserted human kappa light chain Ig locus, the human kappa light chain Ig locus 
comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 
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12. A transgenic non-human mammal having a genome, the genome comprising 
modifications, the modifications comprising: 

an inactivated endogenous heavy chain immunoglobulin (Ig) locus; 

•» • 

an inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising 
a nucleotide sequence substantially corresponding to the nucleotide sequence of yH2; and 

an inserted human kappa light chain Ig locus, the human kappa light chain Ig locus 
comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 

13, A transgenic non-human mammal having a genome, the genome comprising 
modifications, the modifications comprising: 

an inactivated endogenous heavy chain immunoglobulin (Ig) locus; 
an inactivated endogenous kappa light chain Ig locus; 

an inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising 
a nucleotide sequence substantially corresponding to the nucleotide sequence of yH2 without 
the presence of a human gamma-2 constant region; and 

an mserted human kappa light chain Ig locus, the human kappa light chain Ig locus 
comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 

14. A transgenic non-human mammal having a genome, the genome comprising 
modifications, the modifications comprising: 

an inactivated endogenous heavy chain immunoglobulin (Ig) locus; 
an inactivated endogenous kappa light chain Ig locus; 

an inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising 
a nucleotide sequence substantially corresponding to the nucleotide sequence of yH2 without 
the presence of a human gamma-2 constant region; and 

an inserted human kappa light chain Ig locus, the human kappa light chain Ig locus 
comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 

15. A transgenic non-human manimal having a genome, the genome comprising 
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modifications, the modifications comprising: 

an inacti'-ited endogenous heavy chain immunoglobulin (Ig) locus; 

an inserted human heavy chain Ig locus, the human heavy chain Ig locus comprising 
a nucleotide sequence substantially corresponding to the nucleotide sequence of yH2 without 
the presence of a human gamma-2 constant region; and 

an inserted human kappa light chain Ig locus, the human kappa light chain Ig locus 
comprising a nucleotide sequence substantially corresponding to the nucleotide sequence of 
yK2. 

16. A method for the production of human antibodies, comprising: 
inoculating a mammal of any one of Claims 1-10 with an antigen; 

collecting and immortalizing lymphocjrtic cells to obtain immortal cell lines secreting 
human antibodies that specifically bind to the antigen with an affinity of greater than 10^ M'*; 
and 

isolating the antibodies from the immortal cell lines. 

1 7. The method of Claim 1 1 , wherein the antigen is EL-S. 

18. The method of Claim 1 1 , wherein the antigen is EGFR. 

19. The method of Claim 1 1, wherein the antigen is TNF-a. 

20. An antibody produced by the method of Claim 11. 

21 . An anti-IL-8 antibody produced by the method of Claim 12. 

22. An anti-EGFR antibody produced by the method of Claim 13. 

23. An anti-TNF-a antibody produced by the method of Claim 14. 

24. In a method for the production of transgenic mice, the transgenic mice having a 
genome, the genome comprising modifications, the modifications comprising insertion of a plurality 
of human variable regions, the improvement comprising; 

insertion of the human variable regions from a yeast artificial chromosome. 

25. Transgenic mice and transgenic offspring therefrom produced through use of the 
improvement of Claim 1 9. 

26. In a transgenic mammal, the transgenic mammal comprising a genome, the genome 
comprising modifications, the modifications comprising an inserted human heavy chain 



-90. 



o 



o 



wo 98/24893 PCT/US97/23091 

immunoglobulin transgene, the improvement comprising: 

the transc^ene comprising selected sets of human variable region genes that enable 
human-like junctional diversity and human-like complementarity determining region 3 (CDR3) 
lengths. 

27. In the improvement of Claim 26, wherein the human-like junctional diversity comprises 
average N-addition lengths of 7.7 bases. 

28. In the improvement of Claun 26, wherein the human-like CDR3 lengths comprise 
between about 2 through about 25 residues with an average of about 14. 
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